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Foreword 


More than a generation of German-speaking students around the world have worked 
their way to an understanding and appreciation of the power and beauty of modern the¬ 
oretical physics—with mathematics, the most fundamental of sciences—using Walter 
Greiner’s textbooks as their guide. 

The idea of developing a coherent, complete presentation of an entire field of sci¬ 
ence in a series of closely related textbooks is not a new one. Many older physicians 
remember with real pleasure their sense of adventure and discovery as they worked 
their ways through the classic series by Sommerfeld, by Planck, and by Landau and 
Lifshitz. From the students’ viewpoint, there are a great many obvious advantages to 
be gained through the use of consistent notation, logical ordering of topics, and co¬ 
herence of presentation; beyond this, the complete coverage of the science provides a 
unique opportunity for the author to convey his personal enthusiasm and love for his 
subject. 

These volumes on classical physics, finally available in English, complement 
Greiner’s texts on quantum physics, most of which have been available to English- 
speaking audiences for some time. The complete set of books will thus provide a 
coherent view of physics that includes, in classical physics, thermodynamics and sta¬ 
tistical mechanics, classical dynamics, electromagnetism, and general relativity; and 
in quantum physics, quantum mechanics, symmetries, relativistic quantum mechanics, 
quantum electro- and chromodynamics, and the gauge theory of weak interactions. 

What makes Greiner’s volumes of particular value to the student and professor alike 
is their completeness. Greiner avoids the all too common “it follows that...,” which 
conceals several pages of mathematical manipulation and confounds the student. He 
does not hesitate to include experimental data to illuminate or illustrate a theoretical 
point, and these data, like the theoretical content, have been kept up to date and top¬ 
ical through frequent revision and expansion of the lecture notes upon which these 
volumes are based. 

Moreover, Greiner greatly increases the value of his presentation by including 
something like one hundred completely worked examples in each volume. Nothing is 
of greater importance to the student than seeing, in detail, how the theoretical concepts 
and tools under study are applied to actual problems of interest to working physicists. 
And, finally, Greiner adds brief biographical sketches to each chapter covering the 
people responsible for the development of the theoretical ideas and/or the experimen¬ 
tal data presented. It was Auguste Comte (1789-1857) in his Positive Philosophy who 
noted, “To understand a science it is necessary to know its history.” This is all too 
often forgotten in modern physics teaching, and the bridges that Greiner builds to the 
pioneering figures of our science upon whose work we build are welcome ones. 

Greiner’s lectures, which underlie these volumes, are internationally noted for their 
clarity, for their completeness, and for the effort that he has devoted to making physics 
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an integral whole. His enthusiasm for his sciences is contagious and shines through 
almost every page. 

These volumes represent only a part of a unique and Herculean effort to make all 
of theoretical physics accessible to the interested student. Beyond that, they are of 
enormous value to the professional physicist and to all others working with quantum 
phenomena. Again and again, the reader will find that, after dipping into a particular 
volume to review a specific topic, he or she will end up browsing, caught up by often 
fascinating new insights and developments with which he or she had not previously 
been familiar. 

Having used a number of Greiner’s volumes in their original German in my teach¬ 
ing and research at Yale, I welcome these new and revised English translations and 
would recommend them enthusiastically to anyone searching for a coherent overview 
of physics. 

Yale University D. Allan Bromley 

New Haven, Connecticut, USA Henry Ford II Professor of Physics 



Preface to the Second Edition 


I am pleased to note that our text Classical Mechanics: Systems of Particles and 
Hamiltonian Dynamics has found many friends among physics students and re¬ 
searchers, and that a second edition has become necessary. We have taken this op¬ 
portunity to make several amendments and improvements to the text. A number of 
misprints and minor errors have been corrected and explanatory remarks have been 
supplied at various places. 

New examples have been added in Chap. 19 on canonical transformations, dis¬ 
cussing the harmonic oscillator (19.3), the damped harmonic oscillator (19.4), infini¬ 
tesimal time steps as canonical transformations (19.5), the general form of Liouville’s 
theorem (19.6), the canonical invariance of the Poisson brackets (19.7), Poisson’s the¬ 
orem (19.8), and the invariants of the plane Kepler system (19.9). 

It may come as a surprise that even for a time-honored subject such as Clas¬ 
sical Mechanics in the formulation of Lagrange and Hamilton, new aspects may 
emerge. But this has indeed been the case, resulting in new chapters on the “Extended 
Hamilton-Lagrange formalism” (Chap. 21) and the “Extended Hamilton-Jacobi equa¬ 
tion” (Chap. 22). These topics are discussed here for the first time in a textbook, and 
we hope that they will help to convince students that even Classical Mechanics can 
still be an active area of ongoing research. 

I would especially like to thank Dr. Jurgen Struckmeier for his help in constructing 
the new chapters on the Extended Hamilton-Lagrange-Jacobi formalism, and Dr. Ste¬ 
fan Scherer for his help in the preparation of this new edition. Finally, I appreciate the 
agreeable collaboration with the team at Springer-Verlag, Heidelberg. 


Frankfurt am Main 
September 2009 


Walter Greiner 




Preface to the First Edition 


Theoretical physics has become a many faceted science. For the young student, it 
is difficult enough to cope with the overwhelming amount of new material that has 
to be learned, let alone obtain an overview of the entire field, which ranges from 
mechanics through electrodynamics, quantum mechanics, field theory, nuclear and 
heavy-ion science, statistical mechanics, thermodynamics, and solid-state theory to 
elementary-particle physics; and this knowledge should be acquired in just eight to ten 
semesters, during which, in addition, a diploma or master’s thesis has to be worked on 
or examinations prepared for. All this can be achieved only if the university teachers 
help to introduce the student to the new disciplines as early on as possible, in order to 
create interest and excitement that in turn set free essential new energy. 

At the Johann Wolfgang Goethe University in Frankfurt am Main, we therefore 
confront the student with theoretical physics immediately, in the first semester. The¬ 
oretical Mechanics I and II, Electrodynamics, and Quantum Mechanics I—An Intro¬ 
duction are the courses during the first two years. These lectures are supplemented 
with many mathematical explanations and much support material. After the fourth 
semester of studies, graduate work begins, and Quantum Mechanics II—Symmetries, 
Statistical Mechanics and Thermodynamics, Relativistic Quantum Mechanics, Quan¬ 
tum Electrodynamics, Gauge Theory of Weak Interactions, and Quantum Chromo¬ 
dynamics are obligatory. Apart from these, a number of supplementary courses on 
special topics are offered, such as Hydrodynamics, Classical Field Theory, Special 
and General Relativity, Many-Body Theories, Nuclear Models, Models of Elementary 
Particles, and Solid-State Theory. 

This volume of lectures, Classical Mechanics: Systems of Particles and Hamil¬ 
tonian Dynamics , deals with the second and more advanced part of the important field 
of classical mechanics. We have tried to present the subject in a manner that is both 
interesting to the student and easily accessible. The main text is therefore accompa¬ 
nied by many exercises and examples that have been worked out in great detail. This 
should make the book useful also for students wishing to study the subject on their 
own. 

Beginning the education in theoretical physics at the first university semester, and 
not as dictated by tradition after the first one and a half years in the third or fourth 
semester, has brought along quite a few changes as compared to the traditional courses 
in that discipline. Especially necessary is a greater amalgamation between the ac¬ 
tual physical problems and the necessary mathematics. Therefore, we treat in the first 
semester vector algebra and analysis, the solution of ordinary, linear differential equa¬ 
tions, Newton’s mechanics of a mass point, and the mathematically simple mechanics 
of special relativity. 

Many explicitly worked-out examples and exercises illustrate the new concepts 
and methods and deepen the interrelationship between physics and mathematics. As a 
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matter of fact, the first-semester course in theoretical mechanics is a precursor to the¬ 
oretical physics. This changes significantly the content of the lectures of the second 
semester addressed here. Theoretical mechanics is extended to systems of mass points, 
vibrating strings and membranes, rigid bodies, the spinning top, and the discussion of 
formal (analytical) aspects of mechanics, that is, Lagrange’s, Hamilton’s formalism, 
and Hamilton-Jacobi formulation of mechanics. Considered from the mathematical 
point of view, the new features are partial differential equations, Fourier expansion, 
and eigenvalue problems. These new tools are explained and exercised in many physi¬ 
cal examples. In the lecturing praxis, the deepening of the exhibited material is carried 
out in a three-hour-per-week theoretica , that is, group exercises where eight to ten stu¬ 
dents solve the given exercises under the guidance of a tutor. 

We have added some chapters on modern developments of nonlinear mechanics 
(dynamical systems, stability of time-dependent orbits, bifurcations, Lyapunov expo¬ 
nents and chaos, systems with chaotic dynamics), being well aware that all this mate¬ 
rial cannot be taught in a one-semester course. It is meant to stimulate interest in that 
field and to encourage the students’ further (private) studies. 

The last chapter is devoted to the history of mechanics. It also contains remarks on 
the lives and work of outstanding philosophers and scientists who contributed impor¬ 
tantly to the development of science in general and mechanics in particular. 

Biographical and historical footnotes anchor the scientific development within the 
general context of scientific progress and evolution. In this context, I thank the pub¬ 
lishers Harri Deutsch and F.A. Brockhaus ( Brockhaus Enzyklopadie , F.A. Brockhaus, 
Wiesbaden—marked by [BR]) for giving permission to extract the biographical data 
of physicists and mathematicians from their publications. 

We should also mention that in preparing some early sections and exercises of our 
lectures we relied on the book Theory and Problems of Theoretical Mechanics , by 
Murray R. Spiegel, McGraw-Hill, New York, 1967. 

Over the years, we enjoyed the help of several former students and collabo¬ 
rators, in particular, H. Angermuller, R Bergmann, H. Betz, W. Betz, G. Binnig, 
J. Briechle, M. Bundschuh, W. Caspar, C. v. Charewski, J. v. Czamecki, R. Fick- 
ler, R. Fiedler, B. Fricke, C. Greiner, M. Greiner, W. Grosch, R. Heuer, E. Hoff¬ 
mann, L. Kohaupt, N. Krug, R Kurowski, H. Leber, H.J. Lustig, A. Mahn, B. Moreth, 
R. Morschel, B. Muller, H. Muller, H. Peitz, G. Plunien, J. Rafelski, J. Reinhardt, 
M. Rufa, H. Schaller, D. Schebesta, H.J. Scheefer, H. Schwerin, M. Seiwert, G. Soff, 
M. Soffel, E. Stein, K.E. Stiebing, E. Stammler, H. Stock, J. Wagner, and R. Zim- 
mermann. They all made their way in science and society, and meanwhile work as 
professors at universities, as leaders in industry, and in other places. We particularly 
acknowledge the recent help of Dr. Sven Soff during the preparation of the English 
manuscript. The figures were drawn by Mrs. A. Steidl. 

The English manuscript was copy-edited by Heather Jones, and the production of 
the book was supervised by Francine McNeill of Springer-Verlag New York, Inc. 


Johann Wolfgang Goethe-Universitat 
Frankfurt am Main, Germany 


Walter Greiner 
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Newton's Equations in a Rotating 
Coordinate System 


In classical mechanics, Newton’s laws hold in all systems moving uniformly relative 
to each other (i.e., inertial systems) if they hold in one system. However, this is no 
longer valid if a system undergoes accelerations. The new relations are obtained by 
establishing the equations of motion in a fixed system and transforming them into the 
accelerated system. 

We first consider the rotation of a coordinate system (V, /, z f ) about the origin 
of the inertial system (x,y,z) where the two coordinate origins coincide. The inertial 
system is denoted by L (“laboratory system”) and the rotating system by M (“moving 
system”). 



Fig. 1.1. Relative position of 
the coordinate systems x,y,z 
and x', y', z' 


In the primed system the vector A (t) = A[e[ + A' 2 e 2 + A' 3 e ' 3 changes with time. 
For an observer resting in this system this can be represented as follows: 

dA dA\ dA~ dA\ 

— = -^e + ~r^ e 2 + 

dt M dt dt dt 

The index M means that the derivative is being calculated from the moving system. 
In the inertial system (x, y, z) A is also time dependent. Because of the rotation of the 
primed system the unit vectors , e 2 , e ' 3 also vary with time; i.e., when differentiating 
the vector A from the inertial system, the unit vectors must be differentiated too: 

dA 

dt 


+ AjCi + A 2 e 2 + A 3 e 3 . 

M 


dA 

dt 


dA, . dA~ . dA\ . 

— — —e 1 H———e 2 H———+ A 1 e 1 + A 2 e 2 + A 3 e 3 
r dt 1 dt dt * 11 z z J J 


Generally the following holds: (d/dt)(e' y • e y ) = e' y • e r y +e r K • e y = (d/dt)(l) = 0. 
Hence, e y • e' y =0. The derivative of a unit vector e y is always orthogonal to the vector 
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1 Newton's Equations in a Rotating Coordinate System 


itself. Therefore the derivative of a unit vector can be written as a linear combination 
of the two other unit vectors: 

= a \ e ' 2 + <22^3, 

= T $ 463 , 

£3 = <2561 +a 6 e' 2 . 

Only 3 of these 6 coefficients are independent. To show this, we first differentiate 
= 0 , and obtain 


Multiplying = a\e 2 + < 22^3 by e 2 and correspondingly e 2 = <23 + <2463 by e^, 

one obtains 


e 2 . = a\ and e\ -e 2 = a 3 , 


and hence <23 = —a \. Analogously one finds <25 = —<24 and <25 = —< 22 . 

The derivative of the vector A in the inertial system can now be written as follows: 


dA 

dt 


dA 

dt 

dA 

dt 


T A^ (ci\fz 2 + ^ 2 ^ 3 ) T ^ 2 ^—T * 2463 ) T - A 3 (—$ 2^1 — $ 4 © 2 ) 


M 


T cj (—A^ — $ 2 A 3 ) H- — CL 4 . A 3 ) T ($ 2 A^ T ci 4 A 2 ). 


M 


From the evaluation rule for the vector product, 

C X A: 

= e[(C 2 A' 3 - C 3 A' 2 ) - e' 2 (CiA' 3 - C 3 A[) + e' (CiA^ - C 2 A [), 


1 ^2 c 3 

Cl C 2 C 3 

a; a' a ' 3 


it follows by setting C = (< 24 , —a 2 , a\) that 


dA 


dt 


L 


dA 

dt 


+ C x A. 

M 


We still have to show the physical meaning of the vector C. For this purpose we con¬ 
sider the special case dA/dt\M = 0; i.e., the derivative of the vector A in the moving 
system vanishes. A moves (rotates) with the moving system; it is tightly “mounted” 
in the system. Let cp be the angle between the axis of rotation (in our special case the 
z-axis) and A. The component parallel to the angular velocity co is not changed by the 
rotation. 

The change of A in the laboratory system is then given by 


dA = codtAsirup or 


dA 

dt 


= coA sirup. 
L 


This can also be written as 


dA 

dt 


= co x A. 

L 
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Fig. 1.2. Change of an arbi¬ 
trary vector A tightly fixed to 
a rotating system 


The orientation of (co x A )dt also coincides with dA (see Fig. 1.2). Since the 
(fixed) vector A can be chosen arbitrarily, the vector C must be identical with the 
angular velocity co of the rotating system M. By insertion we obtain 


dA 


dt 


L 


dA 

dt 


+ co x A. 

M 


( 1 . 1 ) 


This can also be seen as follows (see Fig. 1.3): If the rotational axis of the primed 
system coincides during a time interval dt with one of the coordinate axes of the 
nonprimed system, e.g., co = (pe 3 , then 


= cpe 2 and e' 2 = —<pe [, 


i.e., 


a\=<p, <22 = <24 = 0 , and hence C = (pe' 3 =co. 



Fig. 1.3. \dt[\ = \de' 2 \ = 
(p • dt 


In the general case co = co\e\ + 002^2 + ^> 3 ^ 3 , one decomposes co — ^co[ with 
coi = coiet , and by the preceding consideration one finds 

Cf = C 0 i ; i.e., C = £] C ,• = £«,•= «. 

i i 
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1 Newton's Equations in a Rotating Coordinate System 


1.1 Introduction of the Operator!) 

To shorten the expression dF(x,... ,t)/dt = dF/dt, we introduce the operator D = 
d/dt. The inertial system and the accelerated system will be distinguished by the in¬ 
dices L and M, so that 

^ d ^ d 

Dl = — and Dm = — 

at L at M 

The equation 

dA 

= — + co x A 

L dt M 

then simplifies to 

Dl A = Dm A -\-co x A. 

If the vector A is omitted, the equation is called an operator equation 
Dl = D m +(ox, 

which can operate on arbitrary vectors. 

EXAMPLE 

1.1 Angular Velocity Vector co 

dco 

= — + (O X (O. 

L dt M 

Since co x co = 0, it follows that 

dco dco 
dt dt m 

These two derivatives are evidently identical for all vectors that are parallel to the 
rotational plane, since then the vector product vanishes. 

EXAMPLE 

1.2 Position Vector r 

dr dr 

dt L dt 

in operator notation this becomes 
D^r = Dy/r + co x r, 


+ co x r, 

M 


dco 

dt 


dA 

dt 


1.2 Formulation of Newton's Equation in the Rotating Coordinate System 
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where ( dr/dt)\M is called the virtual velocity and (dr/dt )\m Coo x r the true velocity. 
The term oo x r is called the rotational velocity. 


1.2 Formulation of Newton's Equation in the Rotating Coordinate 
System 


Newton’s law mr = F holds only in the inertial system. In accelerated systems, there 
appear additional terms. First we consider again a pure rotation. 

For the acceleration we have 

d - - - - 

r L = — (r) L = D l (D l y) = (% + wx)(%r + w x r) 
at 

= D 2 m y + Dm (to x r) + co x + co x (oo x r) 

= D 2 m y + ( Dmoo ) x r + 2co x + wx(wxr). 


We replace the operator by the differential quotient: 

+ oo x (o> x r). 


d 2 Y 

dt 2 


d 2 r 

d £ 2 


M 


dco 

+ -y - 
dt 


M 


dY 

x r + 2oo x — 
dt 


( 1 . 2 ) 


M 


The expression (< doo/dt)\M x r is called the linear acceleration, 2 co x ( dY/dt)\M 
the Coriolis acceleration , and w x x r) the centripetal acceleration. 
Multiplication by the mass m yields the force F: 


m 


d 2 Y 

dt 2 


doo 

dY 

-Cm—— 

x r + 2 moo x — 

m “t 

M dt 


+ moo x(wxr)=F. 


M 


The basic equation of mechanics in the rotating coordinate system therefore reads 
(with the index M being omitted): 


d 2 Y doo 

m = F — in x r — 2moo x v — moo x (cuxr). 


dt 2 


dt 


(1.3) 


The additional terms on the right-hand side of (1.3) are virtual forces of a dynamical 
nature, but actually they are due to the acceleration term. For experiments on the earth 
the additional terms can often be neglected, since the angular velocity of the earth 
w = 2 jt/T ( T = 24 h) is only 7.27 • 10 “ 5 s~ l . 


1.3 Newton's Equations in Systems with Arbitrary Relative Motion 

We now drop the condition that the origins of the two coordinate systems coincide. 
The general motion of a coordinate system is composed of a rotation of the system 
and a translation of the origin. If R points to the origin of the primed system, then the 
position vector in the nonprimed system is r = R + y' . 


Example 1.2 
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1 Newton's Equations in a Rotating Coordinate System 


Fig. 1.4. Relative position of 
the coordinate systems x,y,z 
and x',y', z' 



For the velocity we have r = R + r f , and in the inertial system we have as before 
d 2 r 


m 


dt 2 


F. 


By inserting r and differentiating, we obtain 


m 


d 2 r' 
dt 2 


+ m 


d 2 R 
dt 2 


= F. 


The transition to the accelerated system is performed as above (see (1.3)), but here 
we still have the additional term mR: 


dV 

dt 2 


= F — m 


M 


d 2 R 
dt 2 


dco 

i — 

dt 


x r f — 2mco xvm- moo x (co x r f ). (1.4) 


M 












Free Fall on the Rotating Earth 


2 


On the earth, the previously derived form of the basic equation of mechanics holds if 
we neglect the rotation about the sun and therefore consider a coordinate system at the 
earth center as an inertial system. 

mr'\M = F — wR|l — moo x y'\m — 2 moo x y'\m — moo x (oo x r'). (2.1) 

The rotational velocity oo of the earth about its axis can be considered constant in time; 
therefore, moo x r ; = 0 . 

The motion of the point R, i.e., the motion of the coordinate origin of the system 
(.x', y, z'), still has to be recalculated in the moving system. According to (2.1), we 
have 

R|l = R| m + 6 >l m x R + 2 oo x R|m +oo x (oo x R). 

Since R as seen from the moving system is a time-independent quantity and since 
oo is constant, this equation finally reads 

R| l =00 x (oo x R). 

This is the centripetal acceleration due to the earth’s rotation that acts on a body 
moving on the earth’s surface. For the force equation (2.1) one gets 

mr' = F — moo x (oo x R) — 2 moo xr' — moo x (oo x r r ). 

Hence, in free fall on the earth—contrary to the inertial system—there appear vir¬ 
tual forces that deflect the body in the x'- and y'■ -directions. 

If only gravity acts, the force F in the inertial system is F = — yMmr/r 3 . By inser¬ 
tion we obtain 

Mm 

mr f = —y ——r — moo x (oo x R) — 2 moo x r f — moo x (oo x r'). 
r i 

We now introduce the experimentally determined value for the gravitational accel¬ 
eration g: 


g = -y—R-w x (oo x R). 

Here we have inserted in the gravitational acceleration — yMr/r 3 the radius 
r = R + r f and kept the approximation r ~ R, which is reasonable near the earth’s 
surface. The second term is the centripetal acceleration due to the earth’s rotation, 
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2 Free Fall on the Rotating Earth 


Fig. 2.1. Octant of the globe: 
Position of the various coordi¬ 
nate systems 



which leads to a decrease of the gravitational acceleration (as a function of the geo¬ 
graphical latitude). The reduction is included in the experimental value for g. We thus 
obtain 

mr' = mg — 2 moo x r' — moo x (co x r'). 

In the vicinity of the earth’s surface (r' R) the last term can be neglected, since 
co 2 enters and \co\ is small compared to 1/s. Thus the equation simplifies to 

r / = g- 2 (wxr / ) or r' =—ge' 3 — 2(co x r'). (2.2) 

The vector equation is solved by decomposing it into its components. First one suit¬ 
ably evaluates the vector product. From Fig. 2.1 one obtains, with ei, e2, e3 the unit 
vectors of the inertial system and e[,e f 2 , e 3 the unit vectors of the moving system, the 
following relation: 

e 3 = (e 3 • ei)ei + (e 3 • e 2 )e 2 + (e 3 • e 3 )e 3 
= (— sinA.)e / 1 + Oe^ + (cos A.)e 3 . 

Because co = coe 3, one gets the component representation of co in the moving system: 
co = —a; sin + co cos A,e 3 . 

Then for the vector product we get 

co x r f = (—ar/cosA^ej + (^msinA + i^cos A,)e2 — (coy' sinA.)e 3 . 

The vector equation (2.2) can now be decomposed into the following three compo¬ 
nent equations: 

x = 2y'co cos X , 

y' = — 2co(z sin A. + i'cosA,), (2.3) 

Z — —g + 2ory / sinA.. 

This is a system of three coupled differential equations with co as the coupling parame¬ 
ter. For co = 0, we get the free fall in an inertial system. The solution of such a system 







2.1 Perturbation Calculation 
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can also be obtained in an analytical way. It is, however, useful to learn various ap¬ 
proximation methods from this example. We will first outline these methods and then 
work out the exact analytical solution and compare it with the approximations. 

In the present case, the perturbation calculation and the method of successive ap¬ 
proximation offer themselves as approximations. Both of these methods will be pre¬ 
sented here. The primes on the coordinates will be omitted below. 


2.1 Perturbation Calculation 

Here one starts from a system that is mathematically more tractable, and then one 
accounts for the forces due to the perturbation which are small compared to the re¬ 
maining forces of the system. 

We first integrate (2.3): 

x = 2coy cos A. + c i, 

y = — 2co(x cos A. + z sin A.) + C 2 , (2.4) 

z = —gt + 2coy sinA, + C 3 . 

In free fall on the earth the body is released from the height h at time t = 0; i.e., for 
our problem, the initial conditions are 

z(0) = h, z(0) = 0, 

y(0) = 0, y(0) = 0, 

*(0) = 0, i(0) = 0. 

From this we get the integration constants 
ci=0, C 2 — 2coh sin A,, C 3 = 0, 
and obtain 

x = 2coy cos X , 

y = —2co(x cos A. + (z — h) sin A.), (2.5) 

z = —gt + 2 coy sin A.. 

The terms proportional to co are small compared to the term gt. They represent the 
perturbation. The deviation y from the origin of the moving system is a function of 
co and t; i.e., in the first approximation the term y\(co, t) ~ co appears. Inserting this 
into the first differential equation, we find an expression involving co 2 . Because of the 
consistency in co we can neglect all terms with co 2 , i.e., we obtain to first order in co 


*(0 = 0 , z(t) = -gt , 

and after integration with the initial conditions we get 
x(t) = 0 , z(t) = -|t 2 + /z. 
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2 Free Fall on the Rotating Earth 


Because x(t) = 0, in this approximation the term 2&uccosA drops out from the 
second differential equation (2.5); there remains 

y = —2 co(z — h) sin A. 

Inserting z leads to 



= cogt 2 sin A. 

Integration with the initial condition yields 
cog sin A a 

y = —z—r. 


The solutions of the system of differential equations in the approximation co n = 0 


with n> 2 (i.e., consistent up to linear terms in co) thus read 
x(t) = 0, 

cog sin A o 

yW = ^3-r 3 , 

z(t) = h- 1 1 2 . 

The fall time T is obtained from z(t = T)= 0: 



From this one finds the eastward deflection (e' 2 points east) as a function of the fall 
height: 

cog sin A 2 h 
y(t = T) = y(h) = ———- 



3g 



2.2 Method of Successive Approximation 

If one starts from the known system (2.5) of coupled differential equations, these equa¬ 
tions can be transformed by integration to integral equations: 


t 


x(0 = 2o;cosA J y(u)du + c\, 
0 



0 


0 



0 
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Taking into account the initial conditions 


*(0) = 0, i(0) = 0, 

y(0) = 0, y(0) = 0, 

z(0) = h, z(0)=0, 

the integration constants are 

C\— 0, C‘2 = 0, C 3 = h. 


The iteration method is based on replacing the functions x(u), y (u), z(u) under 
the integral sign by appropriate initial functions. In the first approximation, one de¬ 
termines the functions x(t), y(t),z(t) and then inserts them as x(u), y(u), z(u) on the 
right-hand side to get the second approximation. In general there results a successive 
approximation to the exact solution if co • t = 2nt / T (T = 24 hours) is sufficiently 
small. 

By setting x(u), y(u), z(u) to zero in the above example in the zero-order approxi¬ 
mation, one obtains in the first approximation 

x (1) (f) = 0, 
v (l, (r) = 2 coht sin A, 
z (1) 0) = h- 1 1 2 . 

To check the consistency of these solutions up to terms linear in co , we have to check 
only the second approximation. If there is consistency, there must not appear terms 
that involve co linearly: 

t t 

x^ 2 \t) = 2cocosX J y^\u) du = 2cocosX J 2coh(sinX)u du 
0 0 
t 2 

= 4 co 2 h cos X sin X — = / (co 2 ) ~ 0. 

Like x^\t), z^ l \t) is consistent to first order in co: 


z KA) (t) = h — -gt z + 2cosmX / y^\u)du 


So 


= h ——t -\-2cosinX / 2coh(sinX)u du 


L 

/ 

0 

t 

I 


■ h — —t 2 + i(co 2 ). 
2 
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On the contrary, is not consistent in co , since 


i i 

y( 2 \t) = 2coht sin A, — 2&>cosA J x^\u) du — 2a; sin A J z^ l \u)du 


= 2coh sin A • t — 2coh sin A • t + go;(sinA)- 


t 2 

= geo (sink) — y^2coh (sink) t k(co ). 


We see that in this second step the terms linear in co once again changed greatly. The 
term 2coht sin A obtained in the first iteration step cancels completely and is finally 
replaced by geo (sin A) t 3 /3. A check of y^(0 shows that y^ 2 \t) is consistent up to 
first order in co. 

Just as in the perturbation method discussed above, we get up to first order in co the 
solution 


x(t) = 0, 

geo sin A o 


z(t) = h- 1 1 2 . 


We have of course noted long ago that the method of successive approximation (itera¬ 
tion) is equivalent to the perturbation calculation and basically represents its concep¬ 
tually clean formulation. 


2.3 Exact Solution 

The equations of motion (2.3) can also be solved exactly. For that purpose, we start 


again from 

x = 2&>cosAy, (2.3a) 

y = — 2&>(sinAz + cos Ai), (2.3b) 

z = —g 2co sin Ay. (2.3c) 

By integrating (2.3a) to (2.3c) with the above initial conditions, one gets 

x = 2co cos Ay, (2.5a) 

y = —2co (sin A z + cos Ax) + 2 co sin A/z, (2.5b) 

z = — gt + 2a; sin Ay. (2.5c) 

Insertion of (2.5a) and (2.5c) into (2.3b) yields 

y + 4 co 2 y = 2 cog sin A t = ct. (2.6) 


The general solution of (2.6) is the general solution of the homogeneous equation and 
one particular solution of the inhomogeneous equation, i.e., 

c 

y = —wt + A sin 2 cot + B cos 2 cot. 

4 co 1 
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The initial conditions at the time t = 0 are x = y = 0, z = h, and x = y = z = 0. It 
follows that B = 0 and 2 coA = — c/4co 2 , i.e., A = — c/8cd 3 and therefore 


c c c / sin2<z>A 

y = —^ sin 2^ = -—^ I t — 


4 co 2 8 co 3 


4 co 2 


2 co 


' 


i.e., 


y = - 


g sin A / sin loot 


loo 


loo 


Insertion of (2.7) into (2.5a) yields 
sin loot \ 


x = g sin A cos AI t — 


loo 


- ■ 


From the initial conditions, it follows that 

ft 2 1— cos loo A 

x = g sin A cos A-»- . 

V 2 4 co 2 J 

Equation (2.7) inserted into (2.5c) yields 


z = —gt + 2<z>sinA 


g sin A / sin loot 


loo 


t - 


loo 


2 / sin 2a; t\ 

z = —gt + g sin A( £- 2o)—)’ 


and integration with the initial conditions yields 


z = ~^t 2 + gsin 2 A( — 


1 — cos 2 cot 
4 oo 2 


+ h. 


Summarizing, one finally has 


( 2 . 1 ) 


( 2 . 8 ) 


(2.9) 


x = g sin A cos A 
g sin A 


y = 


loo 


t - 


t 2 1 — cos 2 cot 
2 4m 2 

sin loot N 


loo 


8+2 


z — h — -t + gsin A — - 


1 — cos loot 
4 co 2 


( 2 . 10 ) 


Since cot = 27rfall time/1 day, i.e., very small (cot <$C 1), one can expand (2.10): 
gt 2 2 

v = —sinAcosA(&tf) , 

6 


gt 

y = -y sinA(<z>A 


( 2 . 11 ) 


z = h-—[i- 


gtu, ,2 

—cot 


3 
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If one considers only terms of first order in cot , then {cot) 2 ~ 0, and (2.11) becomes 


x{t) = 0, 

got 3 sin A. 

y(t) = —-- 


z(t) = h- 1 1 2 . 


( 2 . 12 ) 


This is identical with the results obtained by means of perturbation theory. However, 
(2.10) is exactl 

The eastward deflection of a falling mass seems at first paradoxical, since the earth 
rotates toward the east too. However, it becomes transparent if one considers that the 
mass in the height h at the time t — 0 in the inertial system has a larger velocity 
component toward the east (due to the earth rotation) than an observer on the earth’s 
surface. It is just this “excessive” velocity toward the east which for an observer on the 
earth lets the stone fall toward the east, but not _L downward. For the throw upward 
the situation is the opposite (see Exercise 2.2). 


Fig. 2.2. Cut through the earth 
in the equatorial plane viewed 
from the North Pole: M is the 
earth center, and co the angular 
velocity 



EXAMPLE 

2.1 Eastward Deflection of a Falling Body 

As an example, we calculate the eastward deflection of a body that falls at the equator 
from a height of 400 m. 

The eastward deflection of a body falling from the height h is given by 
2 co sin Xh flit 

vw = —5—y-. 

The height h = 400 m, the angular velocity of the earth co = 7.27 • 10 -5 rads -1 , 
and the gravitational acceleration is known. 

Inserting the values in y(h) yields 

2-7.27 • 400 radm / 2-400 s 2 
y{k)= 3 • 10 5 s V 9.81 ’ 
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where rad is a dimensionless quantity. The result is 


Example 2.1 


y(h) = 17.6cm. 

Thus, the body will be deflected toward the east by 17.6 cm. 


EXERCISE 

2.2 Eastward Deflection of a Thrown Body 

Problem. An object will be thrown upward with the initial velocity i>o. Find the 
eastward deflection. 

Solution. If we put the coordinate system at the starting point of the motion, the 
initial conditions read 

z(t = 0) = 0, z(t = 0) = vo, 

y(t = 0) = 0, y(t = 0) = 0, 

x(7 = 0) = 0, x(t = ()) = (). 

The deflection to the east is given by y, the deflection to the south by x; z / 0 denotes 
the height h above the earth’s surface. 

For the motion in y-direction we have, as has been shown (see (2.4)), 

dy 

— = —2co(x cos A. + zsinA.) + C 2 . 

dt 

The motion of the body in x-direction can be neglected; x ~ 0. If one further ne¬ 
glects the influence of the eastward deflection on z, one immediately arrives at the 
equation 


z = -|^ 2 + vo t, 


which is already known from the treatment of the free fall without accounting for the 
earth’s rotation. Insertion into the above differential equation yields 




At the turning point (after the time of ascent T = vq/ g), the deflection is 



It points toward the west, as expected. 
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2 Free Fall on the Rotating Earth 


EXERCISE 

2.3 Superelevation of a River Bank 

Problem. A river of width D flows on the northern hemisphere at the geographical 
latitude cp toward the north with a flow velocity no. By which amount is the right bank 
higher than the left one? 

Evaluate the numerical example D = 2 km, no = 5 km/h, and cp = 45°. 

Solution. For the earth, we have 
d 2 r 

m -^2 — —mge 3 — 2 mco x v with co = —co sin + co cos Xe ' 3 . 

The flow velocity is v = — noe^, and hence, 
co x v = —cov 0 sin cpe' 2 . 

Then the force is 

mr = F = —mge 3 + 2mm no sin^e^ = F^e' 3 + /^e^. 

F must be perpendicular to the water surface (see Fig. 2.3). With the magnitude of 


the force 


F = y 4 m 2 co 2 VQ sin 2 cp + m 2 g 2 

one can, from Fig 2.3, determine H = D sin a and sin a = F 2 /F. For the desired 
height H one obtains 


2covq sirup 2 Dcovq sirup 



Fig. 2.3. 


c_ 3 CO 



H 


For the numerical example one gets a bank superelevation of H ~ 2.9 cm. 


EXERCISE 


2.4 Difference of Sea Depth at the Pole and Equator 


Problem. Fet a uniform spherical earth be covered by water. The sea surface takes 
the shape of an oblate spheroid if the earth rotates with the angular velocity co. 
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Find an expression that approximately describes the difference of the sea depth at 
the pole and equator, respectively. Assume that the sea surface is a surface of con¬ 
stant potential energy. Neglect the corrections to the gravitational potential due to the 
deformation. 

Solution. 


FeffOO = - 


ymM 2 

— -z — e r +mco rsin^e*, 
r z 


r =r • sin #, 



7*2 

V| n=~j FeffV) ■ dr 

7*1 


7*2 

7 (- 

7*1 

ymM 


ymM 2 i 

-e r + mco rsmifre x ]dr-e r 


ri mco 2 r 2 sin 2 & 
n 2 


We therefore define 


Veff(r) = 


ymM 

r 


2 2 
mco r 

2 


sin 2 


(2.13) 


Let 


r = R + Ar($); Ar($) R. 


The potential at the surface of the rotating sphere is constant by definition: 


V(r) = 


ymM 

R 


+ Vo- 


According to the formulation of the problem, the earth’s surface is an equipoten- 
tial surface. From this it follows that the attractive force acts normal to this surface. 
Because of the constancy of the potential along the surface, no tangential force can 
arise. 


V(r) 


= _ra£L( 


Ar t 7 
— sin & 
R 


ymM 

R— 


+ F 0 . 


From this it follows that 


V 0 : 


ymM 

R 2 


m 


Ar - -co 2 R 2 sin z & — mco z RAr sin z 


• 2 , 


As can be seen by inserting the given values, the last term can be neglected: 
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2 Free Fall on the Rotating Earth 


Exercise 2.4 


From this it follows that 


ymM 


m 


Ar = Vo + —co 2 R 2 sin 2 ft. 


R 2 2 

or explicitly for the difference A r(ft), 

R 2 


Ar(ft) = 


V 0 + -« 2 fl 2 sin 2 I. 


ymM 


(2.14) 


The second requirement for the evaluation of the deformation is the volume conserva¬ 
tion. Since one can assume Ar R, we can write this requirement as a simple surface 
integral 


tt/2 27T 

I I da • Ar($) = 0, 

tf=0<p=0 


and hence, because of the rotational symmetry in (p , 

7T/2 


/( 

0 


mco 2 R 2 sin 2 ft \ 

Vq H- )2jtR R smftdft = 0, 


from which follows 

jt/2 


/( 

0 

With 


mco 2 R 2 sin 3 ft 


Vo sin ft + 


dft = 0. 


7T/2 


sinz^J^ = 1 and 


o 

one gets 


tt/2 

/ 


sim ftdft = 


m 


V 0 + — co 2 R 2 = 0, 


m 99 
V 0 = --co 2 R 2 . 


By inserting this result into (2.14), one obtains 

R 4 CO 1 ( 2 2\ 

A rift') =-1 sin 2 ft -I. 


(2.15) 


yM 2 

In the last step yM/R 2 is to be replaced by g; thus we have found an approximate 
expression for the difference of the sea depth: 


co 2 R 2 


sim ft — - 


A r(ft) = 


2 g 


(2.16) 








2.3 Exact Solution 
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By inserting the given values 

R = 6370km, g = 9.81^, to = —= 7.2722 • 1(T 5 
s 1 T s 

we get 

d = Ar — Ar(0) ~ 10.94 km. 

If one wants to include the influence of the deformation on the gravitational po¬ 
tential, one needs the so-called spherical surface harmonics. They will be outlined in 
detail in the lectures on classical electrodynamics. 1 


Exercise 2.4 


1 


See W. Greiner: Classical Electrodynamics , 1st ed., Springer, Berlin (1998). 





Foucault's Pendulum 


3 


In 1851, Foucault 1 found a simple and convincing proof of the earth rotation: A pen¬ 
dulum tends to maintain its plane of motion, independent of any rotation of the sus¬ 
pension point. If such a rotation is nevertheless observed in a laboratory, one can only 
conclude that the laboratory (i.e., the earth) rotates. 

Figure 3.1 shows the arrangement of the pendulum and fixes the axes of the coor¬ 
dinate system. 



Fig. 3.1. Principle of the Fou¬ 
cault pendulum 


We first derive the equation of motion of the Foucault pendulum. For the mass point 
we have 

F = T + mg, (3.1) 

where T is a still unknown tension force along the pendulum string. In the basic equa¬ 
tion that holds for moving reference frames, 

dco 

mr = F — m — x r — 2 moo x v — moo x (oo x r), (3.2) 

dt 


1 Jean Bernard Leon Foucault [fuk’o], French physicist, b. Sept. 18, 1819 Paris-d. Feb. 11, 1868. 
In 1851, Foucault performed his famous pendulum experiment in the Pantheon in Paris as a proof of 
the earth’s rotation. In the same year he proved by means of a rotating mirror that light propagates 
in water more slowly than in air, which was important for confirming the wave theory of light. He 
investigated the eddy currents in metals detected by D.F. Arago (Foucault-currents), and also studied 
light and heat radiation together with A.H.L. Fizeau. 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_3, © Springer-Verlag Berlin Heidelberg 2010 


23 









24 


3 Foucault's Pendulum 


the linear forces and the centripetal forces can be neglected, because for the earth’s ro¬ 
tation dco/dt = 0 and t • \co\ <£ 1, t 2 co 2 ~ 0 (t & pendulum period). By inserting (3.1) 
into the simplified equation (3.2), we get 

mr = T + mg — 2mm x v. (3.3) 

As is obvious from this equation, the earth’s rotation is expressed for the moving 
observer by the appearance of a virtual force, the Coriolis force. The Coriolis force 
causes a rotation of the vibrational plane of the pendulum. The string tension T can 
be determined from (3.3) by noting that 


T = (T • ei)ei + (T • «£)<£ + (T • e 3 )e' = Tj 

_ T (-x,-y,l-z) 

V * 2 + y 2 + (l - z ) 2 
_ T (-x, -y,l-z) 
l 


(3.4) 


In the last step, we presupposed a very large pendulum length /, so that x/l 1, 
y / Z 1, and z/1 1. Evaluation of the scalar product therefore yields 


T = T 


x , y , z l , 


(3.5) 


Before inserting (3.5) into (3.3), it is practical to decompose (3.3) into individual com¬ 
ponents. For this purpose one has to evaluate the vector product co x v: 


(O x v = 


—co sin X 
x 


0 

y 


&>cos 


k 


= —cucosA.ye^ +&>(cosAi; + sinA.z)e 2 — co sin Xye' 3 . 


(3.6) 


By inserting (3.5) and (3.6) into (3.3) with mg = — mge' 3 , we obtain a coupled 
system of differential equations: 


mx = — — T + 2mm cos Ay, 


my = — — T — 2mm (cos kx + sinAz), 

l — z 

mz = —-—T — mg + 2mm sin Ay. 


(3.7) 


To eliminate the unknown string tension T from the system (3.7), we adopt the already 
mentioned approximations: 
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Fig. 3.2. Projection of the 
string tension T onto the 
axes ej 


The pendulum string shall be very long, but the pendulum shall oscillate with small 
amplitudes only. From this it follows that x/Z <£ 1, y/l <£ 1, and z/1 1, since the 

mass point moves almost in the x, y-plane. Hence, for calculating the string tension 
we use the approximation 

—J— = 1’ mi — 0, (3.8) 

and obtain from the third equation (3.7) 

T = mg — 2mco sin Ay. (3.9) 


Insertion of (3.9) and (3.7) after division by the mass m yields 
g 2co sin A 


x ~~i x + 


l 


-xy + 2&>cosAy, 


g 2&>sinA . 

y = — —y H--- yy — 2<x>cos Ax. 


(3.10) 


Equation (3.10) represents a system of nonlinear, coupled differential equations; non¬ 
linear since the mixed terms xy and yy appear. Since the products of the small num¬ 
bers co, x, and y (or co, y, and y) are negligible compared to the other terms, (3.10) 
can be considered equivalent to 


x = — —x + 2&>cos Ay, 


y = — —y — 2co cosAx. 


(3.11) 


These two linear (but coupled) differential equations describe the vibrations of a pen¬ 
dulum under the influence of the Coriolis force to a good approximation. In the fol¬ 
lowing we will describe a method of solving (3.11). 
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3 Foucault's Pendulum 


3.1 Solution of the Differential Equations 

For solving (3. 1 1), we introduce the abbreviations g/1 — k 2 and co cos A = a, multiply 
y by the imaginary unit i = V—T, and obtain 

x = — k 2 x — 2 ai 2 y 

iy = —k 2 iy — 2 aix (3.12) 

x + iy = —k (.x + iy) — 2ai(x + iy) 

The abbreviation u = x + iy is obvious: 

u = —ku—2aiu or 0 = w + 2aiu + k u. (3.13) 

Equation (3.13) is solved by the ansatz useful for all vibration processes, 

u = C-e yt , (3.14) 

where y is to be determined by inserting the derivatives into (3.13): 

Cy 2 e yt + 2cdCye y ‘ +k 2 Ce yt = 0 or y 2 + 2iay + k 2 = 0. (3.15) 

The two solutions of (3.15) are 

yi /2 = —ioi =b ikyj 1 + ot 2 /k 2 . (3.16) 

Since a 2 = co 2 cos 2 A because co 2 /k 2 is small compared to 1 (a) 2 /k 2 = T 2 tn J T 2 ^ 
<^ 1, where r penc \ is the pendulum period and T eart h = 1 day), it further follows that 

y \/2 = —iot ± ik. (3.17) 

The most general solution of the differential equation (3.13) is the linear combination 
of the linearly independent solutions 

u = A-e Ylt + B-e Y2t , (3.18) 

where A and B must be fixed by the initial conditions and are of course complex, i.e., 
can be decomposed into a real and an imaginary part: 

u = (Ai + iA 2 )e- i{a - k)t + (B\ + iB 2 )e- ,(a+k)t . (3.19) 

The Euler relation e~ l(p = cos <p — i sin (p allows one to split (3.19) into u = v + iy : 

x + iy = (Ai + /A 2 )[cos(a — k)t — i sin (a — k)t] 

+ (^i + /i? 2 )[cos(a + k)t — i sin (a + k)t], (3.20) 

from which it follows after separating the real and the imaginary parts 

x = A\ cos(a — k)t + A 2 sin(a — k)t + B\ cos(a + k)t + B 2 sin(a + k)t, 

(3.21) 

y = —A 1 sin(a — k)t + A 2 cos(c)' — k)t — B\ sin((y + k)t + B 2 008 ( 0 ? + k)t. 





3.1 Solution of the Differential Equations 
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Let the initial conditions be 

X() = 0 , X() = 0 , 

yo = L, yo = 0, 

i.e., the pendulum is displaced by the distance L toward the east and released at the 

time t = 0 without initial velocity. Inserting xo = 0 in (3.21), one gets 

B\ = —A\. 

Differentiating (3.21) and setting xo = 0 yields 


As already noted in (3.16), a k and thus B 2 ~ A 2 . From (3.21) one now obtains 
x = A\ cos(a — k)t + A 2 sin (a — k)t — A \ cos(a + k)t + A 2 sin (a + k)t, 


(3.22) 


y = —A\ sin(o' — k)t + A 2 cos(a — k)t + A\ sin^ + k)t + A 2 cos^ + k)t. 

We still have to include the initial conditions for yo and yo- From yo = 0 and (3.22) 
we get 

— A\(a — k) + A\(a + k) = 0 => Ai=0. 

From yo = L and (3.22) we get 



By inserting these values one obtains 



Using the trigonometric formulae 


sin(a zb k) = sin a cos k =b cos a sink, cos(a ±k) = cos a cos k =p sin a sink, 


it follows that 


x = L sinc^ coskt, y = Lcosat coskt. 


The two equations can be combined into a vector equation: 


r = L cos kt [sin(otf )®i + cos(oT)e 2 ] • 


(3.23) 
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3 Foucault's Pendulum 


Fig. 3.3. The unit vector n(0 
rotates in the x, y-plane 



3.2 Discussion of the Solution 

The first factor in (3.23) describes the motion of a pendulum that vibrates with the am¬ 
plitude L and the frequency k = +Jg/l . The second term is a unit vector n that rotates 
with the frequency a — co cos A and describes the rotation of the vibration plane: 

r = Lcosktn(t), 
n(0 = sinoTei + cosat e 2 . 

Equation (3.23) also tells us in what direction the vibrational plane rotates. For the 
northern hemisphere cos A. > 0, and after a short time sin at > 0 and cos at > 0, i.e., 
the vibrational plane rotates clockwise. An observer in the southern hemisphere will 
see his pendulum rotate counter-clockwise, since cos A, < 0. 

At the equator the experiment fails, since cos A. = 0. Although the component co x = 
—co sin A. takes its maximum value there, it cannot be demonstrated by means of the 
Foucault pendulum. 

Following the path of the mass point of a Foucault pendulum, one finds rosette 
trajectories. Note that the shape of the trajectories essentially depends on the initial 
conditions (see Fig. 3.4). The left side shows a rosette path for a pendulum released at 
the maximum displacement; the pendulum shown on the right side was pushed out of 
the rest position. 


Fig. 3.4. Rosette paths of the 
Foucault pendulum 




Because of the assumption a k in (3.16), (3.23) does not describe either of the 
two rosettes exactly. According to (3.23), the pendulum always passes the rest posi¬ 
tion, although the initial conditions were adopted as in the left figure. 










3.2 Discussion of the Solution 
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EXERCISE 


3.1 Chain Fixed to a Rotating Bar 


Problem. A vertical bar A B rotates with constant angular velocity co. A light non- 
stretchable chain of length l is fixed at one end to the point O of the bar, while the 
mass m is fixed at its other end. Find the chain tension and the angle between chain 
and bar in the state of equilibrium. 

Solution. Three forces act on the body, viz. 

(1) the gravitation (weight): F g = —mge 3 ; 

(2) the centrifugal force: F z = — mco x (co x r); 

(3) the chain tension force: T — — T sin^ei + T cos ^ 3 . 



Fig. 3.5. e b e 2> e 3 are the unit 
vectors of a rectangular co¬ 
ordinate system rotating with 
the bar; T is the chain tension 
force; F g is the weight of the 
mass m; F z is the centrifugal 
force 


Since the angular velocity has only one component in the e 3 -direction, co = coe 3 , and 
r = /(sin^ei + (1 — cos<p)e 3 ), 
we find for the centrifugal force 
F z = —m(o) x {co x r)) 
the expression 

F z = +mcD 2 l sin <p ei. 

If the body is in equilibrium, the sum of the three forces equals zero: 

0 = —mge 3 + mco 2 1 sin cpe 1 — T sin cpe 1 + T cos cpe 3 . 

When ordering by components, we obtain 

0 = (mco 2 l sin cp — T sin<p)ei + (T cos cp — mg)e 3 . 
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3 Foucault's Pendulum 


Exercise 3 .7 


Since a vector vanishes only if every component equals zero, we can set up the fol¬ 
lowing component equations: 

mco 2 l sin cp — T sin <p = 0, (3.24) 

T cos — mg = 0. (3.25) 

One solution of (3.24) is sin cp = 0. It represents a state of unstable equilibrium that 
happens if the body rotates on the axis A B. In this case the centrifugal force compo¬ 
nent vanishes. A second solution of the system is found by assuming simp / 0. We 
can then divide (3.24) by simp and get the tension force T : 

T = mco 2 l (3.26) 

and after elimination of T from (3.25) we get the angle <p between the chain and the 
bar: 


cos (p = 


CO 2 1 


Since the chain O P with the mass m in P moves on the surface of a cone, this arrange¬ 
ment is called the cone pendulum. 


EXERCISE 


3.2 Pendulum in a Moving Train 


Problem. The period of a pendulum of length l is given by T . How will the period 
change if the pendulum is suspended at the ceiling of a train that moves with the 
velocity v along a curve with radius R1 

(a) Neglect the Coriolis force. Why can you do that? 

(b) Solve the equations of motion (with Coriolis force!) nearly exactly (analogous to 
Foucault’s pendulum). 

Solution, (a) The backdriving force is 
mv 2 

F r = —mg sin <p + cos <p. 

One has v(x) = co(R + x) = co(R + l sin<p) and R f = R + v. Hence, it follows that 
Fr = —mg sin cp + mco 2 (R + / sin cp) cos <p. 

The differential equation for the motion therefore reads 
ms = —mg sin<p + mco 2 (R + l sin<p) cos<p. 




3.2 Discussion of the Solution 
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Since s = lep , s = Up , it follows that Up = — g sin<p + or(/? +1 sin<p) cos<p, or 


8 


l 


R 


ip + — sin cp — co z (-b sin cp I cos (p = 0, 


(3.27) 


For small amplitudes, cos cp ~ 1 and sirup ^(p, i.e., 


8 


<P+ -r<p — o) — + <p 


R 


l 


or 


<P + 


£ l\ 2 ^ n 

7 ~“ r - " y =0 


(3.28) 


Here, the Coriolis force was neglected, since the angular velocity cp and hence x is 
small compared to the rotational velocity t ) = co(R +x), i.e., wxx = 0. The solution 
of the homogeneous differential equation is 


<fi h = s m(yf 

The particular solution of the inhomogeneous differential equation is 

c» 2 (R/l) 

Vl ig/l)-co 2 ' 

The general solution of (3.27) is therefore 
<P = <Ph+<Pi= sin (j j - <w 2 f 

Hence, the vibration period is 
T 27T 

- V(g/o-« 2 ' 

For cd = 'sfgjl the period becomes infinite, since the centrifugal force exceeds the 
gravitational force. This interpretation gets to the core of the matter, although the 
formula (3.28) holds only for small angular velocities: For large angular velocities the 


\ <o 2 (R/l) 

) + {g/l)-co 2 ' 
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3 Foucault's Pendulum 


Exercise 3.2 


approximation of small vibration amplitudes x in (3.27) is no longer allowed, because 


the pendulum mass is being pressed outward due to the centrifugal force, i.e., to large 
values of x. 

(b) The equations of motion read 
dco 

mr = F — rn — x r — 2 moo x v — moo x (co x (r + R)). 
dt 

With 

co — —eve-, R = Re x , —2mcoxx = 2mco(—y,x,0), 
and 

—mco x (co x (r + R)) = mco 2 (R + x, y, 0), 
one finds 

mx = — T x — 2m coy + mco 2 (R + x) = — — T — 2mcoy + mco 2 (R + x), 

my = —T y + 2m cox + mco 2 y = — y T + 2m cox + mco 2 y, (3.29) 

z 

mi — —T z + mg = — -T + mg. 

In the following, we will assume that the pendulum length is large, i.e., for small 
amplitudes z ~ l (2 = 2 = 0). The string tension is then given by T = mg: 




With the substitution u = x + iy, it follows further that 



(3.30) 


For the homogeneous solution of the differential equation (3.30) one gets with the 
ansatz Whom = ce yt the characteristic polynomial 



The homogeneous solution then takes the form 

Whom = Cl exp(; (co + y/g/J)t) + c 2 exp (/ (co - yfgfl )/). 


The particular solution is simply obtained as 





3.2 Discussion of the Solution 
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From this it follows that 


U — Whom Wpart 

= c\ exp(/ (co + yfgjl)t) + C 2 exp (i[co — yfgfl )0 + 


co z R 


C 8/D-co 2 ' 


(3.31) 


With the initial condition x(0) = xo and y(0) = x(0) = y( 0) = 0, it follows for c\ 
and C 2 


_ Vgfl -CO f '" 2 

c l — “t:— i — rr I x o + 


co z R 


2VgTi 1 
\fgjl + co , 


co 2 - g/l)' 


co z R 


c2 'um ( X0+ o t-g/i 


(3.32) 


By decomposing the solution (3.31) into real and imaginary parts, the solutions for 
x(t) and y(t) can be found. 


. IS \ I [g\ 0)1 R 

X = Cl COS ( CO + J- Jt + C2COSI CO — ^ y Jt + — - ^ 




c ° 2r \l [s [s, , , . [g, . J 

co 2 - g /i)wi V/ yi 


+ ■ 


co 2 R 


g/l-0) 2 

= ^*0 + 


co 2 R 


0 . cos « —t cos cot + co — sin —t sin cot \ + 

v 2 -g/ij i v / x g y i 


co 2 R 


g/l-O) 2 ’ 

(3.33) 


• / , g \. , ■ g \, 

y = c\ sin I co + J — J t + C 2 sin I co — J — 1 1 


— + 


(O 2 R \ I . [g / 1 . [g 

—«- 1 smcotcos. -t — co I - sin -t coscot 

co z ~g/lJ Ml Mg Ml I 


(3.34) 


Because s/g/L co^JTJg ^ 1. From this it follows that 


x = xqcos — t cos cot , 


y = xqcos a / - t smcot. 


Exercise 3.2 


This describes a rotation of the pendulum plane with the frequency co (as for Foucault’s 
pendulum). 
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3 Foucault's Pendulum 


The pendulum period T can now be obtained from the following consideration: For 
t — 0, the brace in (3.33) equals 1. For t = (n/2)*JTJg + t\ where t' (jt/2)y/l/g, 
the brace vanishes for the first time, which corresponds to a quarter of T. By expanding 
the brace, for t' we find 





On the other hand, in part (a) we found 



which suggests the conclusion that the Coriolis force should not be neglected from the 
outset in this consideration. 


EXERCISE 

3.3 Formation of Cyclones 

Problem. Explain to which directions the winds from north, east, south, and west 
will be deflected in the northern hemisphere. Explain the formation of cyclones. 

Solution. We derive the equation of motion for a parcel of air P that moves near the 
earth’s surface. The X, Y, Z system is considered as an inertial system; i.e., we shall 
not take the rotation of the earth about the sun into account. Moreover, we assume the 
air mass is moving at constant height; i.e., there is no velocity component along the 
z-direction (z = 0). The centrifugal acceleration shall also be neglected. 

With the assumptions mentioned above, the equation of motion of the particle is 
defined by the differential equation 

r = g — 2(oo x r) = g — 2(co± x r) — 2(co\\ x r), 

where co x r = (co± + wy) x r. Let co\\ be the component of co within the tangential 
plane at the point Q of the earth’s surface (see Fig 3.7); it points to the negative 
ei-direction. co± points to the e 3 -direction. 

We consider the dominant term — 2co± x r: An air parcel moving in v-direction 
(south) is accelerated toward the negative y-axis, and a motion along the y-direction 
causes an acceleration in v-direction. The deflection proceeds from the direction of 
motion toward the right side. The wind from the west is deflected toward the south, 
the north wind toward the west, the east wind toward the north, and the south wind 
toward the east. 






3.2 Discussion of the Solution 
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Fig. 3.7. Definition of the 
coordinates: 0 = origin of 
the inertial systems X,Y, Z; 
Q = origin of the moving 
system x,y,z‘, P =a point 
with mass m; yo = position 
vector in the system X,Y,Z\ 
and r = position vector in the 
x,y, z-system 



anticyclone in the northern hemisphere 


Fig. 3.8. If air flows in the 
northern hemisphere from a 
high-pressure region to a low- 
pressure region, a left-rotating 
cyclone arises in the low- 
pressure region, and a right- 
rotating anticyclone is formed 
cyclone in the high-pressure region 


The force — 2m(0\\ x r for the north and south winds exactly equals zero. For the 
west or east winds the force points along e 3 or the opposite direction. Accordingly 
the air masses are pushed away from or toward the ground. This force component is 
however very small compared to the gravitational force mg, which also points toward 
the negative e 3 -direction. 

If we consider an air parcel moving in the southern hemisphere, then X> n/2 and 
cos X is negative. Thus, a west wind is here deflected to the north, a north wind toward 
the east, and a south wind toward the west. 


EXERCISE 


3.4 Movable Mass in a Rotating Tube 


Problem. A tube rotates with constant angular velocity co (relative motion) and is 
inclined from the rotational axis by the angle a. A mass m inside of the tube is pulled 
inward with constant velocity c by a string. 

(a) What forces act on the mass? 

(b) What work is performed by these forces while the mass moves from x\ to * 2 ? 
(Calculate the energy balance!) Numerical values: m = 5 kg, a = 45°, x\ — lm, 
X 2 = 5m, co = 2s~ l , c = 5m/s, g = 9.81 m/s 2 . 

Solution, (a) The mass m within the tube performs a relative motion with constant 
velocity c = c(— 1 , 0 , 0), and thus the resulting acceleration is composed of the guid- 
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Exercise 3.4 


Fig. 3.9. 


ing acceleration a/ in the tube, the relative acceleration a r , and the Coriolis accelera¬ 
tion a c : 

a = a/ + a r + a c . 

The guiding acceleration consists in general of a translational acceleration bo of the 
vehicle, and of the accelerations a t (tangential acceleration) and a n (normal accelera¬ 
tion) due to a rotational motion. 




In the present problem, we are dealing with a rotation about a fixed axis = 
(cos a, 0, sin a) with a constant angular velocity co, so that ao = a r = 0, and the guid¬ 
ing acceleration is therefore 

ay = ao + a r + a n = a n = x sin aco 2 (— sin a?, 0, cos a), 

i.e., the guiding acceleration obviously consists of the centripetal acceleration b n only. 

The relative acceleration a r = 0, since the relative velocity is constant. The Coriolis 
acceleration a c , defined by 

a c = 2 coe^ x c, 

is therefore 

a c = —2 coc sin a (0,1,0). 

Therefore, for the total acceleration a we have 

a = ay + a c = (— xco 2 sin 2 a, — 2&>csina, xco 2 sina cos a). (3.35) 

a is the result of the following forces acting on the mass m (see Fig. 3.9): 

S = S(— 1, 0, 0), G = mg(—cosa, 0 — sincy), 

Ni = Afi(0,0,1), N 2 = N 2 (0, -1,0). 

The resulting total force is therefore 

F = (-S — mgcosa, —N2, N\ — mg sina?). (3.36) 

With Newton’s equation 


F = m • a. 


(3.37) 
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one can determine the unknown quantities S, N\ and N 2 : From (3.35), (3.36) 
and (3.37) it follows that 


(-S — mg cos a, — N 2 , N\ — mg sin a) 

= m(—xco sin a, —2coc sin a, xco sinacoso') 
=> S = m(xco 2 sin 2 a — geos a) 

N\=m (xco 2 sin a cos a + g sin a) 

N 2 — 2comcsina. 


S becomes negative if jc < (g cos a)/(co 2 sin a)’, i.e., the mass m would have to be 
decelerated additionally within the tube if a constant velocity is to be maintained. 

(b) During the motion, work is performed by the string force S, by the gravitational 
force G, and by the Coriolis force N 2 ; Ni is the normal force. The work performed by 
the string force is 


x 2 


- x '2 


w s 


J dw s = - J S(x)dx 

XI 


XI 

m 


= —co 2 sin 2 a (xf — v|) — mg cos a (x\ — X 2 ). 


(3.38) 


The work performed by the weight force is 


Exercise 3.4 


W G = 


X2 

fdW G =„ s «*«(„-«). 


(3.39) 


XI 


The work performed by the Coriolis force, taking into account dx/dt = —c, is 

Wn 2 = J dW^ 2 = — J N 2 ds = — J N 2 X sinadcp 


f dep dt 

/ N 2 X sin a - dx 

J dt dx 


xi 

= —moo 2 sin 2 a(x 2 — x 2 ). (3.40) 

Insertion of the numerical values given in the formulation of the problem yields 

W s = (3.75 - 17.34) Nm=-13.59 Nm, 

= 17.34 Nm, W Nl =7.5Nm. 


To check the results, one uses the fact that the sum of the work performed by the 
external forces must be equal to the difference of the kinetic energies (energy balance) 


AE = W s + W N2 + W G , 
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3 Foucault's Pendulum 


Exercise 3.4 


where 

171 2 2 2 2 771 2 2 2 2 

A E = —(c + x 2 sin ) — — (c + x 1 sin aw ) 

m 9 9 9 9 

= — —cd sin a(x\ — jc 2 ), 
and according to (3.38), (3.39), and (3.40) 

YYl 999 9 

Ws + Wg + Wn 2 = yfiT sin a(xf — jc 2 ) — mg coscy^i — * 2 ) 

2 . 2 2 2 

— mco sin o'(x 1 — x 2 ) + mgcoso'(xi — X 2 ) 

/77 2 2 2 2 

= -*/ sin a(x^ — jc 2 ) • 




Part 

Mechanics of Particle Systems 


So far we have considered only the mechanics of a mass point. We now proceed to 
describe systems of mass points. A particle system is called a continuum if it consists 
of so great a number of mass points that a description of the individual mass points is 
not feasible. On the other hand, a particle system is called discrete if it consists of a 
manageable number of mass points. 

An idealization of a body (continuum) is the rigid body. The notion of a rigid body 
implies that the distances between the individual points of the body are fixed, so that 
these points cannot move relative to each other. If one considers the relative motion of 
the points of a body, one speaks of a deformable medium. 





The number of degrees of freedom / of a system represents the number of coordinates 
that are necessary to describe the motion of the particles of the system. A mass point 
that can freely move in space has 3 translational degrees of freedom: (x, y, z). If there 
are n mass points freely movable in space, this system has 3 n degrees of freedom: 

(■ *i,yi,Zi ), i = 1, 


4.1 Degrees of Freedom of a Rigid Body 

We look for the number of degrees of freedom of a rigid body that can freely move. 
To describe a rigid body in space, one must know 3 noncollinear points of it. Hence, 
one has 9 coordinates: 

ri = (xuyuzi), r 2 = (x 2 , y2,Z2), *3 = O3, y 3 ,Z 3 ), 

which, however, are mutually dependent. Since by definition we are dealing with a 
rigid body, the distances between any two points are constant. One obtains 

(x\ - x 2 ) 2 + (yi - y 2 ) 2 + (zi - Z 2) 2 = C\ = constant, 

(xi - X3) 2 + (yi - y 3 ) 2 + (z\ ~ Z3) 2 — C\ — constant, 

O2 - X3) 2 + (y 2 - y 3 ) 2 + (z 2 ~ Z3) 2 = C 3 1 = constant. 


Three coordinates can be eliminated by means of these 3 equations. The remaining 6 
coordinates represent the 6 degrees of freedom. These are the 3 degrees of freedom of 
translation and the 3 degrees of freedom of rotation. The motion of a rigid body can 
always be understood as a translation of any of its points relative to an inertial system 
and a rotation of the body about this point (Chasles ’ 1 theorem). This is illustrated by 
Fig. 4.1: A ABC AA"B"C ", namely, by translation in A A'B'C' and by rotation 
about the point E' in A A"B"C". 


1 Michael Chasles, French mathematician, b. in Epernon Nov. 15, 1793-d. Dec. 18, 1880, Paris. 
Banker in Chartres; from 1841 to 1851 professor at the Ecole Poly technique; after 1846 professor 
at the Sorbonne in Paris. Chasles is independently of J. Steiner one of the founders of the synthetic 
geometry. His Apergu historique by far surpassed the older representations of the development of 
geometry and stimulated new geometrical research in his age. 


W. Greiner, Classical Mechanics, 

DOI 10.1007/978-3-642-03434-3_4, © Springer-Verlag Berlin Heidelberg 2010 
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4 Degrees of Freedom 


Fig. 4.1. Chasles’ theorem: 
The translation vector de¬ 
pends on the rotation, and vice 
versa 



We now consider the rigid body with one point fixed in space. The motion is com¬ 
pletely described if we know the coordinates of two points 

ri = (x\,y\,z\) and r 2 = (x 2 , yi, Zi) 

and adopt the fixed point as the origin of the coordinate system. Since the body is 
rigid, we have 

x i + yf + z i = constant, x\ + y\ + z 2 = constant, 

{x i - x 2 ) 2 + (>’i - y 2 ) 2 + (zi - z 2 ) 2 = constant. 


^O crrrvN- 

5 


From these 3 equations one can eliminate 3 coordinates, so that the remaining 3 coor¬ 
dinates describe the 3 degrees of freedom of rotation. 

If a particle moves along a given curve in space, the number of degrees of freedom 
is / = 1. The curve can be written in the parametric form 


Fig. 4.2. Example of the para¬ 
metric form: A caterpillar creeps 
on a blade of grass 


x=x(s), y = y(s), Z = z(s), 

i.e., for a given curve the position of the particle is fully determined by specifying one 
parameter value s. 

A deformable medium or a fluid has an infinite number of degrees of freedom (e.g., 
a vibrating string, a flexible bar, a drop of fluid). 



Center of Gravity 
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Definition Let a system consist of n particles with the position vectors r v and the 
masses m v for v = (1,..., n). The center of gravity of this system is defined as point S 
with the position vector r s : 


m\Yi + m 2 r 2 +- V m n r n _ J2v=i m v r v 

mi + m 2 4- Ym n YTv=\ m v 


r s 


1 

M 


n 

^ ^ myfy , 
V — \ 


where M = E”=i m v is the total mass of the system, and 


My s = y^m v r v 

V — \ 

is the mass moment. For systems with uniform mass distribution over a volume V 
with the volume density q , the sum E z m z r z becomes an integral, and one obtains 

_ f v YQ(r)dV 
YS ~ fyQdV 

The individual components are 

Ev m ^v Ey^vJv 

X s = —--, W = - 

M J M 

and for a continuous mass distribution 
_ fyQxdV _ JyQydV 


Zs ■ 


M ’ 


= 


JyQZdV 


M 



Fig. 5.1. Definition of the cen¬ 
ter of gravity 
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5 Center of Gravity 


where the total mass is given by 


m = j 2 


m v 


or M 


-I 


QdV. 


We consider three systems of masses with the centers of gravity ri, r 2 , r 3 and the total 

masses M\ , M 2 , M3 . The system 1 consists of the mass M\ = (m 11 + m 12 + m 13 H-) 

with the position vectors rn, 1 * 12 , 1 * 13 ,the systems 2 and 3 are analogous. Then by 
definition the centers of gravity are 


system 1 : 


system 2 : 


system 3: 


*sl = 

r S 2 = 

*s3 = 


YjjnuIlL 

T,i m u 

E,- W 2i' r 2i 

E/™* 

Ei”*3i r 3i 


For the center of gravity of the total system we have the same relation: 


E, + El w 2fr 2 f + Ya m 3;T3; 

Ei m U + Ei m 2i + Ei w 3i 


_ Mir^i + M 2 r 52 + M 3 r 53 

Mi + M 2 + M 3 

Hence, for composite systems we can determine the centers of gravity and masses of 
the partial systems, and from them calculate the center of gravity of the total system. 
The calculation can thereby be much simplified. This fact is often referred to as the 
cluster property of the center of gravity. 

The linear momentum of a particle system is the sum of the momenta of the indi¬ 
vidual particles: 

n n 

P = 'y ' Py = y ' tflyYy. 
v=l v—\ 

If we introduce the center of gravity by Mr s = JT m ; r ; -, we see that P = M r s , i.e., 
the total momentum of a particle system equals the product of the total mass M united 
in the center of gravity and its velocity r s . This means that the translation of a body 
can be described by the motion of the center of gravity. 


EXERCISE 

5.1 Center of Gravity for a System of Three Mass Points 

Problem. Find the coordinates of the center of gravity for a system of 3 mass points. 

m 1 = 1 g, m 2 = 3 g, m 3 = 10 g. 
ri = (1, 5, 7) cm, r 2 = (—1, 2, 3) cm, r 3 = (0,4, 5) cm. 
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Solution. For the center of gravity, one finds 

r s = -^(1 - 3, 5 + 3 • 2 + 10 • 4, 7 + 3 • 3 + 10 • 5) cm 
or, recalculated, 

r s = —(— 2 , 51, 66 ) cm. 


Exercise 5.1 


EXERCISE 


5.2 Center of Gravity of a Pyramid 


Problem. Find the center of gravity of a pyramid with edge length a and a homoge¬ 
neous mass distribution. 

Solution. Because of the homogeneous mass distribution, the mass density p(r) = 
po = constant. The base of the pyramid is represented by the equation 


x + y + z = a. 


The coordinate axes are the edges, and the origin is the top. Then 


JyPordY 

JvPodV 


SyrdV 

fv dV ’ 


dV = dx dydz. 


The integration limits are evident from Fig. 5.2. The integration runs over z along 
the column from z = 0 to z = a — x — y; over y along the prism from y = 0 to 
y = a — x ; and over x along the pyramid from x = 0 to x = a : 


UrdV 'dzdydx 

fv dV SUSZUS-’dzdyd,' 



X 
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Exercise 5.2 


with 


r = (x,y,z) 


a a—x 



V x=0y =0 


z= a —x—y 

dy dx , 

z=0 


a a—x 

J r dV = J J ^ x(a — x — 


y), y(a — x — y), -(a-x- yf 


0 0 


dy dx. 


The corresponding integration over y and x yields 


/ 


rdV = —(1,1,1), 
24 v ' 


f a 3 

iV - V ~ 6 - 


Thus, the center of gravity is at 


_ f v rdV 
fydV 


a 

4 


( 1 , 1 , 1 ). 


EXERCISE 


5.3 Center of Gravity of a Semicircle 


Problem. Find the center of gravity of a semicircular disk of radius a. The surface 
density is constant. 


Fig. 5.3. 



Solution. The surface density o — constant. (The surface density is defined by 
cr( r) = liniAA^o Am(x, y, z)/AA.) x? and y 5 represent the coordinates of the center 
of gravity. We use polar coordinates for calculating the center of gravity. The equation 
of the semicircle then reads 

r = a, 0 < ip < 7T. 

Because of symmetry, x 5 = 0, and for y s , we have 
f A oy dA f* =0 !?=()('' dr d<p 

y s = -j -- —- 

Ja 


cr dA 


A 


5 Center of Gravity 


47 


The evaluation of the integral yields 

2a 3 /3 4 a 
na 2 /2 3tt ’ 

i.e., the center of gravity lies at r s = (0, 4a 


EXERCISE 

5.4 Center of Gravity of a Circular Cone 


Problem. Determine the center of gravity of 

(a) a homogeneous circular cone with base radius a and height h ; and 

(b) a circular cone as in (a), with a hemisphere of radius a set onto its base. 



Fig. 5.4. 


Solution, (a) Because of symmetry, the center of gravity is on the z-axis, i.e., = 

y 5 = 0. For the z-component, we have 

fk zdv f k zdV 
s f k dV (l/3)7ta 2 h' 

We adopt cylindrical coordinates for evaluating the integral: 


/■ 


2jt a h(l—p/a) 


// / 

p=0g=0 

a 

=2 ’ i\ 


zdV = 
k (p=0 q= 0 z =0 

a 


ZQdgdcpdz 


h ( 1 - QdQ 

a 


■ ith 2 1 - 


n 2 

_ 2 e “ 


^,e_ 4ia 

3 a 4 a 2 


2l2 


a 2 h 


■ TV - 


JO 


12 ’ 


tc h 2 a 2 • 3 1 

\27ta 2 h 4 


Exercise 5.3 
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z 



Fig. 5.5. Because of symme¬ 
try the center of gravity is again 
on the z-axis 


Thus, the center of gravity of a circular cone is independent of the radius of the base, 
(b) See Fig. 5.5. One then has 


= 


fconc zdV + /h 


hemisphere 


zdV j 2 Tth 2 a 2 + f h 


hemisphere 


zdV 


Fcone T ^hemisphere 


(jr/3)(h + 2 a)a 2 


2jt a 


f ziV= fj / gzdcpdgdz 

hemisphere <p =0 £=0 z= _^ a 2_ Q 2 


= tv J ( Q 2 —a 2 )gdQ 

Q=0 

r 4 2 2 


JO 


na 
~~4 ~ 


Hence, the center of gravity is given by 


Zs = 


^7 ra 2 h 2 - \na 4 1 h 2 - 3 a 2 


^(h + 2a) 


4 h + 2a 


y s = o, 


Tv = o. 


EXERCISE 


5.5 Momentary Center and Pole Path 


Problem. 

(a) Show that any positional variation of a rigid disk in the plane can be represented 
by a pure rotation about a point at a finite distance or at infinity. (Hint: The position 
of the disk is already fixed by specifying two points A and B .) 

(b) Show by “differential” variation of position: The planar motion of a rigid disk 
can be described at any moment by a pure rotation about a point varying with the 
motion, the so-called momentary center. The geometric locus of these momentary 
centers is called the pole path or th z fixed pole curve. 

(c) Calculate the fixed pole curve r (<p) for a ladder sliding on two perpendicular walls. 

(d) Calculate the fixed pole curve r(cp) for a bar of length l that can move in the guide 
shown in Fig 5.6. 
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Fig. 5.6. 



Fig. 5.7. 


Solution, (a) For describing the motion of the disk we take the (arbitrary) straight line 
AB \ it turns into the straight line A\B\. The intersection M of the mid-perpendiculars 
onto AA\ and BB\ is the desired center of rotation. 

Argument: The triangles ABM and A\B\M are congruent. Hence the motion can 
be considered as a rotation of the triangle ABM (involving the straight line AB) about 
M by the angle (p. 

(b) For an infinitely small rotation by dcp the same considerations hold. But now 
the individual turning points vary. These are the so-called momentary centers. In a 
differential rotation about a momentary center M, for any point the path element dr 
and the velocity vector v point along the same direction and are perpendicular to the 
connecting lines to M (see Fig. 5.8). The geometric locus of the momentary centers is 
called the pole curve. 



(c) According to (b), one gets Fig. 5.9. The straight line l = AB forms a diagonal 
of the square OBMA. Since the diagonals of a square are equal, M must move along 
a circle of radius l. 
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Fig. 5.9. 



(d) According to (b), one can construct Fig. 5.10. Evidently, 


sinckf = — (1 — sina) and cosa = l —-jj(\ — sirup) 2 , 


AC = l cos a — a cos y, 


- AC cos a 

OM =-= /- a — L 


11 — (a 2 /l 2 )(l — sin<p) 2 

COS 2 (p 


— a. 


cosy cosy V 

Thus, in polar coordinates the equation of the fixed pole curve r(y) reads 


r = r(y) = OM = —a + L 


' 1 — (a 2 /1 2 )( 1 — sin^?) 2 
cos 2 y 



EXAMPLE 

5.6 Scattering in a Central Field 
(1) The problem 

The two-body problem appeared for the first time in recent physics in investigations 
of planetary motion. However, the classical formulation of the two-body problem pro- 































5 Center of Gravity 


51 


vides information both on the bound state as well as on the unbound state (scattering 
state) of a system. 

The study of the unbound states of a system became of great importance in modern 
physics. One learns about the mutual interaction of two objects by scattering them 
off each other and observing the path of the scattered particles as a function of the 
incident energy and of other path parameters. The objects studied in this way are usu¬ 
ally molecules, atoms, atomic nuclei, and elementary particles. Scattering processes 
in these microscopic regions must be described by quantum mechanics. However, one 
can obtain information on scattering processes by means of classical mechanics which 
is confirmed by a quantum mechanical calculation. Moreover, one may learn the meth¬ 
ods for describing scattering phenomena by studying the classical case. 

The schematic arrangement of a scattering experiment is shown in Fig. 5.11. We 
consider a homogeneous beam of incoming particles (projectiles) of the same mass 
and energy. The force acting on a particle is assumed to drop to zero at large distances 
from the scattering center. This guarantees that the interaction is somehow localized. 
Let the initial velocity vo of each projectile relative to the force center be so large that 
the system is in the unbound state, i.e., for t —> oo the distance between the two scat¬ 
tering particles shall become arbitrarily large. For a repulsive potential this happens 
for any value of vq; this does not hold for an attractive potential. 



The interaction of a projectile with the target particle manifests itself by the fact 
that the flight direction after the collision differs from that before the collision (the 
usage of the words “before” and “after” in this context presupposes a more or less 
finite range of the interaction potential). 


(2) Definition of the cross section 

Measured quantities are count rates (number of particles/s in the detector, which is 
assumed to be small). These count rates depend first on the physical data as kind of 
projectile and target, incidence energy, and scattering direction, and second on the 
specific experimental conditions such as detector size, distance between target and 
detector, number of scattering centers, or incident intensity. In order to have a quantity 
that is independent of the latter features, one defines the differential cross section 


do (number of particles scattered to dQ)/s 

- 7 ^: 0 ?, ^ := - it, - r - 

dQ dQ n * I 


(5.1) 


Here, n is the number of scattering centers and I is the beam intensity, which is given 
by (number of projectiles)/(s-m 2 ). The scattering direction is represented here by d 
and ip. d is the angle between the asymptotic scattering direction and the incidence 
direction; it is called the scattering angle, ip is the azimuth angle. dQ denotes the solid- 
angle element covered by the detector. Since we assumed the detector to be small, we 
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Fig. 5.11. Schematic setup of 
a scattering experiment 
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Example 5.6 have 

dQ = sin# dd d(p, (5.2) 

where dd and dcp specify the detector size. Note that (da/dQ)(d, (p) is defined 
by (5.1) and is not a derivative of a quantity a with respect to Q. Obviously 
da/dQ)(d, (p) has the dimension of an area. The standard unit is 

1 b = 1 Bam = 100 (fm ) 2 = 1(T 28 m 2 . (5.3) 


Often the differential cross section will be independent of the azimuth angle (p (we 
shall restrict ourselves to this case), and one can define 


da 

dd 


da 

:= 2n sin# — (#, cp); 
di 2 


(5.4) 


see Fig. 5.12. 


Fig. 5.12. Definition 
cross section 



da 

dd 


(#) = 


(number of particles scattered to dQ)/s 
dd • n •/ 


Finally, we introduce the total cross section, defined by 

2lT IT IT 

[ da C [ da C da 

= j da da { *’ v) = r v l Msin& dU = ) 

0 0 0 


(5.5) 


(5.6) 


It depends only on the kinds of particles, and possibly on the incidence energy: 
(number of scattered particles)/s 

Otot =- 7 -• (5.7) 

n • I 

Like da/dQ it has the dimension of an area. It equals the size of the (Active) area of 
a scattering center which must be traversed perpendicularly by the projectiles in order 
to be deflected at all. 


(3) Introduction of the collision parameter, its relation to the scattering angle, 
and the formula for the differential cross section 

It is clear that the scattering angle d at fixed energy can depend only on the collision 
parameter b , since the initial position and the initial velocity of the particle are then 
specified. The collision parameter is defined as the vertical distance of the asymptotic 
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incidence direction of the projectile from the initial position of the scattered Hence, 
for E = constant the scattering angle is 

d = d(b). (5.8) 

Since the movements in classical mechanics are determined, this connection is unam¬ 
biguous. (This statement is no longer valid in quantum mechanics.) Thus, 

b = b(d), (5.9) 

which means that, by observing an arbitrary particle at a definite scattering angle d , 
one can determine in a straightforward manner the value of the scattering parameter b 
of the incident particle. This fact allows the following consideration. The number dN 
of projectiles per second that move with values b' of the collision parameter 

b <b' <b + db 


toward a scattering center is 


dN = I -iTtbdb 


or 


dN = I • 2itb 


db 

dd 


dd. 


The sign of absolute value stands since the number dN by definition cannot become 
negative. Just this number of particles are scattered into the solid angle element 


dR = 2tt sin d dd. 


By inserting this into (5.5), we get 

da , _ ^ db 
dd y dd 

and for the differential cross section 
db 


da b(d) 

dQ sin# 


dd 


(5.10) 


(5.11) 


This is just the desired relation. The function b(d) is determined by the force law that 
governs the particular case. One realizes that the knowledge of the differential cross 
section allows one to determine the interaction potential between the projectile and 
the target particle. 

In general, the scattering angle will depend not only on the collision parameter but 
also on the incident energy. As a consequence, the differential cross section also be¬ 
comes energy dependent. Hence, one can measure the differential cross section as a 
function of the projectile energy by observing the scattered particles at a fixed scatter¬ 
ing angle. 


(4) Transition to the center-of-mass system, and transformation of the differential 
cross section from the center-of-mass system to the laboratory system 

The considerations of the last section are to some extent independent of the reference 
system. If we move from the laboratory system S to another system S' that moves 
with constant velocity V parallel to the beam axis, the scattering angle and the dif¬ 
ferential cross section (5.5) will change, but the derivation in the last section remains 
unchanged, so that the relation (5.11) remains valid. 


Example 5.6 
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Fig. 5.13. Scattering in the 
laboratory system ( S ) and 
in the center-of-mass system 

(S') 


V 



beam axis 


Fig. 5.14. 




This has a practical meaning inasmuch as cross sections are always measured in 
the laboratory system S where the target is at rest, but the calculation of #(#') often 
simplifies in the center-of mass system S'. We therefore derive a relation between 
these two cross sections. In the following the primed and nonprimed quantities shall 
always refer to these two systems. 

First, we investigate the relation between the scattering angles # and #'. Let \{ 
and V/ be the asymptotic final velocity (/ = final) of the projectile of mass m\ in the 
system S and S', respectively. V is the relative velocity of the two systems. 

From Fig. 5.14, one immediately sees that 

vJ sin ft' sin#' 

tan# = y~c -=- 77 , 

v/ cos ft' + V cos ft' + V/v* 

where V stands for the magnitude of V, and analogously for v'/. Furthermore, 
m\ v\ = (m\ + m 2 )V, 

where v[ is the initial velocity of the projectile in the laboratory system (i = initial), 
and 

v[ = V + v[ i . 

Because m\v ^ = m 2 v’^ andmiiq^ = m 2 V!/ for elastic scattering (E = E^ n ), v[ l = 
v'/, and therefore, 

V m\ 
v/ m 2 ’ 

Hence, 


tan# = 


sin#' 

cos#' + m\/m 2 


(5.12) 


This relation defines the function ft'(ft); we will not give it explicitly. If a projec¬ 
tile in S is scattered into the ring dR with the “radius” ft and the width dft (see 
Fig. 5.12), it will in S' be scattered into a ring dR' with the “radius” #'(#) and the 
width dft' = (dft'/dft)dft. The number of particles scattered to dR in S and to dR' 
in S' is therefore identical, and with (5.5), we get 
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thus, 


or 



da' , dd' 

- (# ) -, 

dd’ dd 


da da' . sin d' dd' 

-(#) =- (d') -. 

dQ dQ' sin# dd 


(5.13) 

(5.14) 


This is the desired connection. 

The difference between the scattering angles and the cross sections, respectively, is 
obviously determined by the mass ratio of projectile and target particle (see (5.12)). 


EXERCISE 


5.7 Rutherford Scattering Cross Section 

Problem. A particle of mass m moves from infinity with the collision parameter b 
toward a force center. The central force is inversely proportional to the square of the 
distance: 


F = kr~ 2 . 

(a) Calculate the scattering angle as a function of b and of the initial velocity of the 
particle. 

(b) What are the differential and the total cross sections? 


Solution, (a) From the discussion of the Kepler problem, we know that the underly¬ 
ing force law has the form 

k 

F = -^. (5.15) 

The minus sign means that the force is attractive. The path equation reads 1 


1 

r 




2 El 1 
mk 2 


cos(0 



(5.16) 


(E = initial energy, l = angular momentum, m = mass of the particle, 0' = integration 
constant). With the standard abbreviation 


/ 2 El 2 

e “V 1 + ^’ 

one can write for (5.16) 

- = ~pr(l + £COS (0 -0 )). 
r l z v 


(5.17) 


(5.18) 


1 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 

(2004). 
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Exercise 5.7 


The path is characterized by £: 


s > 1, E > 0: hyperbola, 

£ = 1, £ = 0: parabola, 

£ < 1, E < 0: ellipse, 

mk 2 . 

s = 0, E =-z-: circle. 

2 1 2 

In the given problem the force law is 



(5.19) 


(5.20) 


The force is repulsive. For illustration, we consider the scattering of charged particles 
by a Coulomb field (e.g., atomic nuclei by atomic nuclei, protons by nuclei, or elec¬ 
trons by electrons, etc.). The scattering force center is created by a fixed charge — Ze 
and acts on the particle with the charge —Z’e. The force is then 


F = 


ZZ'e 2 


(5.21) 


If we set k = —ZZ'e 2 , we can directly take over the equations for an attractive poten- 
tial. The path equation (5.18) now reads 


1 mZZ'e 


2 


-(1 + £COS0). 


r l 2 

The coordinates were rotated so that 0 f = 0. For £ (see (5.17)) it follows that 


(5.22) 


f l + 


2 El 2 


y m{ZZ'e 2 ) 2 
Here, we used the relation 


'1 + 


2 Eb 


{ZZ'e 2 ) 2 


l = mbvoo = b'J'lmE, E = -mv 2 


(5.23) 


(5.24) 



Fig. 5.15. Region of 0 for re¬ 
pulsive Coulomb scattering 


between angular momentum (/) and collision parameter ( b ). 

Since £ > 1, (5.22) represents a hyperbola (see (5.19)). Because of the minus sign, 
the values of 0 for the path are restricted to values for which 

cos 0 < — - (5.25) 

£ 

(see Fig. 5.15). Note that the force center for repulsive forces is in the outer focal point 
(see Fig. 5.16). 

The change of 0 that occurs if the particle comes from infinity, and is then scattered 
and moves to infinity again, equals the angle 0 between the asymptotes, which is the 
supplement to the scattering angle 0 (see Fig. 5.16). 
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Fig. 5.16. Illustration of the 
hyperbolic path of a particle 
that is pushed off by a force 
center. The force center lies in 
the outer focal point 


From Fig. 5.15 and (5.25), it follows that 


c ° s (Vi) =sin G) = ‘ 


= cos — = - 


(5.26) 


The relation cos(0/2) = l/e can be proved as follows: The two limiting angles 6 \ and 
62 satisfy the condition 


cos 0 \ = —, 
£ 

1 

cos 62 = —. 

£ 


From this it follows that (see Fig. 5.17): 


sin$i = — sin $ 2 , 



(5.27) 


(5.28) 



Fig. 5.17. The limiting angles 
9\ and 62 have the same co¬ 
sine 


The first of these equations can be rewritten as 


0 /M • ( e 1 1 • a ■ n o 

2 cos — sin — = sin 6 \ = — sm 62 = —2 cos 


and therefore, 


'01 
sin| 2 



(5.29) 


= sm 


O2 

~2 


(5.30) 
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Exercise 5.7 


We look for cos(0/2) = cos(02 — 0i)/2), 4> = 0 2 — 0\\ 


0 


02 01 


= — COS 


cos — = cos-I = cos — cos — I + sin 


(I)' 


01 


0 2 \ . /0i 
— sin — 


\ 2 


+ shT 


From cos 0i = —1/e, it follows that 


1 


— = cos 0i = cos — I — sin I — 


01 


01 


sin 


= cos 


Insertion into (5.31) yields 


0 2 
cos — = — cos 
2 


GO 


+ cos^ 


+ 


1 1 

H— — — . 

£ £ 


From there we find 


1 / 2 Eb 

(0/2) ~ + \ZZ'e 2 


sin 2 (0/2) 




1 


sin 2 (0/2) 

0 

- = arccot 


- 1 


1 - sin 2 (0/2) 


sin 2 (0/2) 


: COt 2 I - 


2 Eb 
ZZ'e 2 


2 Eb 
ZZ'e 2 


(b) From (5.24) and (5.26) it follows that 
b : 


ZZ'e 2 


, e 

COt - 

2 E V 2 


db ZZ'e 2 1 


dd 4 E sin 2 (0/2) 

The differential cross section as a function of 0 is given by 
da b db 


dQ sin 0 dO 
Thus, one obtains 

da (ZZ'e 2 ) 2 


1 


dQ 2Esin0 2E sin 2 (0/2) 

_ 1 fZZ'e 2 \ 2 cot(0/2) 
~ 2\ 2E 


cot 


and with the identity 


sin 0 = 2 sin - cos - 


sin0 sin 2 (0/2) 


(5.31) 

(5.32) 

(5.33) 

(5.34) 


(5.35) 


(5.36) 


(5.37) 


(5.38) 


(5.39) 
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it follows that 


Exercise 5.7 


da 

dQ 


l /ZZ'e 2 \ 2 

4\lT ) 


sin 4 (0/2) 


1 


(5.40) 


This is the well-known Rutherford scattering formula. The total cross section is cal¬ 
culated according to 



(5.41) 


By inserting da/d£2(6) from (5.40), one quickly realizes that the expression diverges 
because of the strong singularity at 0 =0. This is due to the long-range nature of the 
Coulomb force. If one uses potentials which decrease faster than 1/r, this singularity 
disappears. 

EXERCISE 

5.8 Scattering of a Particle by a Spherical Square Well Potential 

Problem. A particle is scattered by a spherical square well potential with radius a 
and depth Uq : 

U — 0 (r > a), 

U = -U 0 ( r<a ). 

Calculate the differential and the total cross section. 

Hint: Use the refraction law for particles at sharp surfaces which results from the 
following consideration: Let the velocity of the particle before scattering by a sharp 
potential well be v\ = Vqq and after scattering V 2 . Due to momentum conservation 
perpendicular to the incident normal (“transverse momentum conservation”) one has 

Voq sin a = i >2 sin f (5.42) 


siny0 Voq' 

From the energy conservation law it follows that 



(5.44) 


Solving for V 2 yields 



(5.45) 


Insertion into (5.43) finally yields 



(5.46) 
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Exercise 5.8 


Solution. The straight path of the particle is broken when entering and leaving the 
field. We have the relation 


sin a 

-= n, 

sin/3 

where according to (5.46) 


(5.47) 


n = J 1 + 


2U 0 


d2 ' 


The deflection angle is (see Fig. 5.18) 


X 

sin a 
sin/I 


: 2(a - P) 
sin (a — x /2) 
sin a 

sin a cos(x /2) — cos a sin(x /2) 


sin a 

= cos ( — | — cot a sin ( — 

2 V 2 


1 

n 


(5.48) 


Fig. 5.18. In the inner and 
outer region of the spheri¬ 
cal potential well the parti¬ 
cle moves along straight lines. 
When passing the surface it 
will be refracted 



From Fig. 5.18, we have 


a sma = q. 

Because sin 2 a + cos 2 a = 1, it follows that 


cos a = J 1 — ( — ) . 


Now we can eliminate a from (5.49): 
cos(x/2) - 1 /n 


sin(x/2) 


= cot O' = 


cos a a cos a 


sin a 


(5.49) 


(5.50) 


(5.51) 
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and with (5.50), we get 

yr - (e/a ) 2 sin(x/ 2 ) 


g = a 


q 2 = 


(cos(x/ 2 ) - 1 /ft) 

a 2 sin 2 (x / 2 ) — £ 2 sin 2 (x / 2 ) 

(cos(x/ 2 ) — 1 /ft ) 2 

a 2 sin 2 (x/ 2 ) 

(cos(x/ 2 ) - 1 /ft ) 2 + sin 2 (x/ 2 ) 1 - ( 2 /ft)cos(x/ 2 ) + 1 /ft 2 


a 2 sin 2 (x/ 2 ) 


2 2 
Q = <2 


ft 2 sin 2 (x/ 2 ) 


ft 2 — 2n cos(x / 2 ) + 1 

To get the cross section, we differentiate 

_ _ n sin(x/ 2 ) _ 

^ a (n 2 — 2 ft cos(x/ 2 ) + l ) 1 / 2 

with respect to x • 

f cos(x/ 2 ) 


(5.52) 


(5.53) 


dg ^-cos(x/ 2 ) 1 ftft sin(x/ 2 ) • ft sin(x/ 2 ) 

dx (n 2 ~ 2 ft cos(x/ 2 ) + l ) 1 / 2 2 (ft 2 — 2 ft cos(x/ 2 ) + l ) 3 / 2 

^ cos ^ (ft 2 + 1 — 2 ft cos j) — \an 2 sin 2 ^ 


(ft 2 + 1 — 2 ft cos 


X \ 3 / 2 


|ft J cos f + ^ cos j — an z cos z sin 

I^ 2 


(ft 2 + 1 — 2ft cos ^-) 
an ft 2 cos ^ + cos ^ — ft — ft cos 2 ^ 
2 (ft 2 + 1 — 2ft cos f ) 3//2 
ftft (ft cos ^ — l) (ft — cos ^-) 


(ft 2 + 1 — 2 ft cos ^-) 


3/2 


(5.54) 


<Kx) = - 


do 


sin X 

, 2„2 


dg 


dx 


Q » sin(x/ 2 ) |(ncos(x/ 2 ) - l)(n - cos(x/ 2 ))| 
2 sinx (« 2 + 1 — 2 «cos(x/ 2)) 2 

1 |(«cos(x/ 2 ) — \){n — cos(x/ 2 ))| 


2 2 
an 


4 cos(x /2) (ft 2 + 1 — 2 ft cos(x /2 )) 2 


Here, we utilized 


0 X . X 
sinx =2cos - sin-. 

2 2 


The angle / takes the values from zero (for g = 0) up to the value Xmax (for 
which is determined by the equation 


(5.55) 

(5.56) 

Q = a) 


Xmax 


1 

ft 


Exercise 5.8 


cos 


2 


(5.57) 
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Exercise 5.8 


The total cross section obtained by integration of (dcr/dQ)(x) over all angles within 
the cone / < Xmax of course equals the geometrical cross section tt a 2 . 

We still want to show that the total cross section for scattering by the spherical 
square well potential equals the geometrical cross section na 2 . This is obvious since 
for r > a we have U = 0, i.e., there is no scattering. 

We start from (5.55) 


da Q 

-777 (x) = -— 
dQ sin/ 


dQ 

dx 


a 2 n 2 1 [n cos(x/2) — 1 ][n — cos(x/2)] 

4 cos(x/2) [n 2 + 1 — 2t7Cos(x/2)] 2 


and integrate over all angles x from 0 to Xmax (^£2 = 27r sinx dx)’ 


Xmax 

f da 

= J dQ.' 


®tot = * dQ ( ' X ^ d ^ 2 = 7Ta / ” sin 2 


Xmax 

/ 


2 . x [«cos(x/2) - l][n -cos(x/2)] 


[« 2 + 1 — 2n cos(x /2)] 2 


dx 


= 7T<2 


Xmax 

/ 


(1 + 77 2 — 2rc cos(x /2)) 2 


x | (ft 2 + 1) cos ^ sin ^ — tt cos 2 ^ sin ^ — n sin ^ [ dx • (5.59) 


1 


11 


hi 


Part III can be integrated at once; I and II are transformed by integrating by parts: 


^tot : 


tv a 2 [n 2 + 1 —2n cos Jp n 2 


Xmax 


— na 2 n{n 2 + 1) 


cos(x/2) 


(1 + n 2 — 2n cos(x /2)) _ 


Xmax 


na 2 n(n 2 + 1) 


Xmax 

f — 

J (1 +n 
0 


(1/2) sin(x/2) 


2 — 2x7 cos(x/2)) 


dx 


2 2 

+ 77 na \ 


2 2 

— nan 


cos 2 (x/2) 


(1 + « 2 — 2 n cos(x /2)) _ 

Xmax 


Xmax 

0 


/ 


— cos(x/2) sin(x/2) 
(1 + n 2 — 2n cos(x /2)) 


dx- 


(5.60) 


In the last integral, we substitute 
X 

y := cos —, 

* 2 

A 1 • X , 

dy = --sin- dx 


(5.61) 
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and obtain 


(7 { q |; - 7 X Cl 


n 2 { 1 + cos 2 (x/2)) — n(n 2 + 1) cos(x/2) 1 Xmax 


n(n 2 + 1) 


(1 + n 2 — 2 n cos(x/2) 

Xmax 


I. 


/ 

o 


sin(x/2) 


(1 + n 2 — 2n cos(x/2)) 


dx 


cos(x max /2) 


— 2n 


/ 


(1 + n 2 — 2 ny) 


dy 


■ Tta 


l 

" 2 (1 + cos 2 (x/2)) - n(n 2 + 1) cos(x/2) 1 


K 


(1 + n 2 — In cos(x /2)) 


“^J 1+ „2_2„ cos | 


I 


Xmax 


Xmax 


0 


' n 2 + 1 \ o 

+ \ ny + I —-— ln(l + n 2 - 2 ny) 


COS(Xma x/2) 
0 


and finally, with Xmax = 2arccos (1 /n), 

2 [ n 2 ( 1 + 1/n 2 ) — n(n 2 + 1)(1 /xz) n 2 (l + 1) — n(n 2 + 1) • 1 


(7 - TC Cl 


(l+rc 2 -2) 


(1 + n 2 — 2 n) 


=0 


: 7T a 


2 [ n — 2 n 2 + n 2 — n 2 + 2n 2 — n + n 2 — 2n + 1 
(n - l ) 2 


: 7ta 2 . 


+ 1 — n\ 


(5.62) 


Exercise 5.8 


EXERCISE 

5.9 Scattering of Two Atoms 

Problem. A hydrogen atom moves along the v-axis with a velocity vu = 1.78 • 
10 2 m • s _1 . It reacts with a chlorine atom that moves perpendicular to the v-axis with 
vci = 3.2 • 10 1 m • s —1 . Calculate the angle and the velocity of the HCl-molecule. The 
atomic weights are H = 1.00797 and Cl = 35.453. 



Fig. 5.19. 
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Exercise 5.9 


Solution. We utilize momentum conservation. The initial momenta are 


Pj = m 11 ;i , m\ = A i • 1 amu, 

(5.63) 

P 2 = m 2 t; 2 e y , m 2 = A 2 • 1 amu. 

Here, Ai,A 2 mean the atomic weights, and 1 amu (“atomic mass unit”) 
= l/12m( 12 C). We require 

P' = P = (mivi, OT 2 U 2 ) with P'= (mi + m 2 )v', (5.64) 


from which we get 
1 


v = 


(m\V\, M2V2) = 


m\ + m2 

Here, \i is the reduced mass. It is calculated as 

m\m2 


/ Vl 

V2 } 

/ Vl 

V2 \ 

— 

— 

1 =fi[ — 

— 

\m 2 

m\) 

r \m2 

m\J 


M = 

m 1 + m 2 
Thus, one obtains 


= 0.9801 amu. 


v' = (4.9208,31.1154) m-s -1 , 
1 / = 31.502 m-s _1 . 


The angle 0 is found from tan# = v’ y /v' x to be 0 = 81.013°. 


(5.65) 


(5.66) 


(5.67) 



6.1 Linear Momentum of the Many-Body System 

If we consider a system of mass points, for the total force acting on the vth particle 
we have 



( 6 . 1 ) 


The force f v x is the force of the particle k on the particle v; F v is the force acting on 
the particle v from the outside of the system; fvA. is the resulting internal force of 
all other particles on the particle v. 

The resulting force acting on the system is obtained by summing over the individual 
forces: 


Ep-E^+EE^=p- 


v v v X 


Since force equals (-) counter force (here Newton’s third law becomes operative), it 
follows that f v i + = 0, so that the terms of the above double sum cancel pairwise. 

One thus obtains for the total force acting on the system 


p=F=y> 


V 


If no external force acts on the system, one has 
F = P = 0, i.e., P = constant. 

The total momentum P = p y of the particle system is thus conserved if the sum of 
the external forces acting on the system vanishes. 


6.2 Angular Momentum of the Many-Body System 

The situation is similar for the angular momentum if the internal forces are assumed 
to be central forces. 

The angular momentum of the vth particle with respect to the coordinate origin is 
ly =r v X p v . 
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6 Mechanical Fundamental Quantities of Systems of Mass Points 



The angular momentum of a single particle is defined with respect to the origin. The 
same holds for the total angular momentum. The angular momentum of the system 
then equals the sum over all individual angular momenta, 

L =y>. 

V 


Analogously, the torque acting on the vth particle is 


d v =r v x F, 


and the total torque is given by 

D=y>. 

V 


The internal forces do not perform a torque, since we assumed them to be central 
forces. This can be seen as follows: For the force acting on the vth particle, according 
to (6.1) we have 

F v +y>, = ^ Pv . 

x 

By vectorial multiplication of the equation from the left by r v , we obtain 

E d d 

Tv x fyA =r v X — p y = — (r„ x p„) =l v . 
dtdt 

The differentiation can be moved to the left, because r y x p v = 0. Summation over v 
yields 

E r » xF »+EE x f v j\_ — L, 
v xv 

D 0 

D=L=y]iy. 

Here - Ev Ex r v x fy^ = 0, since the terms of the double sum cancel pairwise, e.g., 


r v x f vA + r A x f Av = (r„ - r A ) x f vX . 


Since for central forces (r v — r^) is parallel to f v ^, the vector product vanishes. 

The total torque on a system is given by the sum of the external torques 

D = L. 

For D = 0, it follows that L = constant. If no external torques act on a system, the 
total angular momentum is conserved. 
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EXAMPLE 

6.1 Conservation of the Total Angular Momentum of a Many-Body System: Flat¬ 
tening of a Galaxy 





(a) 


(b) 



Fig. 6.2. Formation of a galaxy from a cloud of gas with angular momentum L: (a) The gas con¬ 
tracts due to the mutual gravitational attraction between its constituents, (b) The gas contracts 
faster along the direction of the angular momentum L than in the plane perpendicular to L, since 
the angular momentum must be conserved. In this way a flattening appears, (c) The galaxy in 
equilibrium: In the plane perpendicular to L, the gravitational force balances the centrifugal 
force due to the rotational motion 


rotation axis 



N 





Fig. 6.3. Demonstration of 
angular momentum conserva¬ 
tion in the absence of external 
torques. A person stands on a 
platform that rotates about a 
vertical axis 


(a) 


(b) 


EXAMPLE 


6.2 Conservation of Angular Momentum of a Many-Body Problem: The Pirou¬ 
ette 


(a) The person holds two weights and is set into uniform circular motion with angular 
velocity co. The arms are stretched out, so that the angular momentum is large. 
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Example 6.2 


(b) If the person pulls the arms towards the body, the moment of inertia (see Chap. 11) 
decreases. Since angular momentum is conserved, the angular velocity co signifi¬ 
cantly increases. Skaters exploit this effect when performing a pirouette. 


6.3 Energy Law of the Many-Body System 

Let f v x be the force of the Ath particle on the vth particle. According to (6.1), we have 
^^ d 

Fy + / fyA. — — (/Wv**v)* 

Scalar multiplication of the equation by r v , with 



leads to 


F v -r v + ^]fyA -r v 

x 


d 

dt 


1 

-m 

2 



(l/2)m v r^ is however the kinetic energy T v of the vth particle. By summation over v, 
we obtain 


E F ^ + ££f,A-r v 

v XV 


V N / V V 


J2 V Tv is the time derivative of the total kinetic energy of the system. By integration 
from t\ to t 2 , with 


r v dt = dr v , 


we get 


T(t 2 )-T(ti) = 



• dr v + 


l 2 

?/ 

vX u 


fyA. ' dv v . 


A a 


At 


( 6 . 2 ) 


T is the total kinetic energy, A a is the work performed by external forces, and A; is 
the work performed by internal forces over the time interval t 2 — t\. 

If we assume that the forces can be derived from a potential, we can express the 
performed internal and external work by potential differences. 
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For the external work, we have 

t2 

S/ v V a -dr v = - f dV v a 

v i 

= -YX v v^- y v(f\)\ 

V 

A a = V a (t l )-V a (t 2 ). 



V“ is the potential of the particle v in an external field. By summing over all particles, 
one obtains the total external potential V a = V“. 

The force acting between two particles v and X is assumed to be a central force. 
For the “internal” potential, we set 

VUn v ) = Vi v (r Xv ) = V^(r vk ). 

The mutual potential depends only on the absolute value of the distance: 

r v a = |r„ - r A | = J(x v - x A ) 2 + (y v ~ P) 2 + (z v - PP- 

Thus, the principle of action and reaction is satisfied, since from this it follows auto¬ 
matically that the force f v x is equal and opposite to the counterforce fx v • 

fvx = -V v = +V A V^ = -fxv 


The index v on the gradient indicates that the gradient is to be calculated with respect 
to the components of the position vector r v of the particle v. Hence, 

V -l— — —1 V - f 3 _L _M 
v_ i dx v 'dy v ' dz v y k ~\dxx ’ dyx dzx \ 

Hence, for the internal work we can write 


M — 



P 

A , V 


hv • drx 


We now replace the difference of the position vectors by the vector r v x =r v —rx and 
introduce the operator V v a, which forms the gradient with respect to this difference. 
We get 


^ = -\E f ■ dr »>- = -jHf d P = ~\- yph)), 

z v,A, J Z v,k J Z v,x 


where 
V v a = 


a 


d(x v - xx) a(y v - yx) d(z v ~ zx) 
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Hence, the internal work is the difference of the internal potential energy. This quantity 
is significant for deformable media (deformation energy). 

For rigid bodies where the differences (distances) |r v — r^| are invariant, the inter¬ 
nal work vanishes. Changes dr v x can occur only perpendicular to r v — and hence 
perpendicular to the direction of force, i.e., the scalar products f v x • dr v x vanish. 

If we set for the total potential energy 



for (6.2) we find 


T(t 2 ) ~ T{t i) = V{t i) - V(t 2 ) 


or 


V(ri) + m) = Vte) + r(r 2 ); 


(6.3) 


the sum of potential and kinetic energy for the total system remains conserved. Since 
energy can be transferred by the interaction of the particles (e.g., collisions between 
gas molecules), energy conservation must not hold for the individual particle but must 
hold for all particles together, i.e., for the entire system. 

6.4 Transformation to Center-of-Mass Coordinates 

When investigating the motion of particle systems, one often disregards the common 
translation of the system in space, since only the motions of particles relative to the 
center of gravity of the system are of interest. One therefore transforms the quantities 
characterizing the particles to a system whose origin is the center of gravity. 

According to Fig. 6.4, the origin of the primed coordinate system is the center 
of gravity; the position, velocity, and mass R, V, and M of the center of gravity are 
denoted by capital letters. One has 


r v = R + r y , r v = V + Vy = R + . 


Fig. 6.4. 



According to the definition of the center of gravity, we have 


M • R = ^m v r v = ^m v (R + r / v ), 


V 


V 


M-R=M-R + yV v , 


V 


where M = m v is the total mass of the system. 







6.4 Transformation to Center-of-Mass Coordinates 


71 


From the last equation, it follows that 

( 6 . 4 ) 

y 

Thus, the sum of the mass moments relative to the center of gravity vanishes. If there 
acts a constant external force, as for example the gravity F v = m v g, then it also follows 
that 

D = y>; x F v = (j2m v r'yj xg = 0. 

V ' V ' 

A body in the earth’s field is therefore in equilibrium if it is supported in the center of 
gravity. 

Differentiation of (6.4) with respect to time yields 

T>vv; = 0, (6.5) 

y 

i.e., in the center-of-mass system the sum of the momenta vanishes. In relativistic 
physics this statement is often used as definition of the “center-of-momentum” sys¬ 
tem; there it is not possible to introduce the notion of the center of mass,—as defined 
above—in a consistent way. Only the “center-of-momentum” system can be formu¬ 
lated in a relativistically consistent way. 

The equivalent transformation of the angular momentum leads to 

L = 2>(r„ xvy) = y>y(R + r;) X (V + Vy), 

y y 

L = y^m v ( R + V) + y]m v ( Rx Vy) + y^m v ( r y xV) + ^m„(r' v x v' v ). 

y y y y 


By appropriate grouping, one obtains 

L = M( R x V) + R x (T> v v;) + (y^m v ry j x V + y^m v (r^ x v'„) 

' y 7 ' y 7 y 

and sees that the two middle terms disappear, because of the definition (6.4) of the 
center-of-mass coordinates. Hence, 


L = M(R x V) + y^OTy(r'y x v'„) = L v + ^l' v . 

y y 


( 6 . 6 ) 


Thus, the angular momentum L can be decomposed into the angular momentum of 
the center of gravity h s = MR x V with the total mass M, and the sum of angular 
momenta of the individual particles about the center of gravity. 

For the torque as the derivative of the angular momentum, the same decomposition 
holds: 


D=D,+y]<. 


(6.7) 
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6.5 Transformation of the Kinetic Energy 

We have 

t =\ X! mv = \ H mv y2 + v ■ H v v + \ H m ^'v 2 - 

V V V V 

Because J2 m vv' v = 0, the middle term again vanishes, and we find 

r = I M V 2 + i^m v v; 2 = 7; + r. (6.8) 

“ V 

The total kinetic energy T is thus composed of the kinetic energy of a virtual particle 
of mass M with the position vector R (t) (the center of gravity), and the kinetic energy 
of the individual particles relative to the center of gravity. Mixed terms, e.g. of the 
form V • V v 2 , do not appear! This is the remarkable property of the center-of-mass 
coordinates, the foundation of their meaning. 


EXERCISE 


6.3 Reduced Mass 


Problem. Show that the kinetic energy of two particles with the masses mi, m 2 
splits into the energy of the center of gravity and the kinetic energy of relative motion. 


Fig. 6.5. Center of gravity and 
relative coordinates of two 
masses 



Solution. The total kinetic energy is 

T = imiv 2 + im 2 v (6.9) 

The center of gravity is defined by 
mjri + m2Y2 

R = -, 

mi + m 2 
and its velocity is 

R=-(mivi+m2V2). (6.10) 

mi + m2 

The velocity of relative motion is denoted by v. We have 


V = Vi — V2. 


( 6 . 11 ) 
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We now express the particle velocity by the center of gravity and relative velocity, 
respectively. 

By inserting V 2 from (6.11) into (6.10), we have 
(mi + m 2 ) R = mi vi + — myv. 


From this, it follows that 
m 2 

vi = RH-v. 

mi + m2 

Analogously, we get 

mi 


V2 : 


R- 


-v. 


mi + m2 

Inserting the two particle velocities into (6.9), we obtain 

2 


T = \ nn 


R + 


m 2 


mi + m 2 


+ ^m2( R - 


mi 


mi + m 2 


or 


2 2 
m\m^y 


^ 1 


1 . 9 1 

T = -MR 2 + -- , , 0 , 

2 2 (mi + m 2) 1 2 (mi + m 2) 1 

1 -9 1 2 

T = -MR 2 + -/xiT. 

2 2 

The mixed terms cancel. The mass related to the center-of-mass motion is the total 
mass M = mi + m 2 ; the mass related to the relative motion is the reduced mass 


fi = 


mi m2 


mi + m 2 

The reduced mass is often written in the form 

1 _ 1 1 

1 1 m\ m2 

It is remarkable that the kinetic energy for two bodies decomposes into the kinetic 
energies of the motion of the center of gravity and of the relative motion. There are no 
mixed terms, e.g., of the form R • v, which considerably simplifies the solution of the 
two-body problem (see the next problem). 


Exercise 6.3 


EXERCISE 


6.4 Movement of Two Bodies Under the Action of Mutual Gravitation 


Problem. Two bodies of masses m\ and m 2 move under the action of their mutual 
gravitation. Let ri and r 2 be the position vectors in a space-fixed coordinate system, 
and r = ri — r 2 . Find the equations of motion for ri, r 2 , and r in the center-of-gravity 
system. How do the trajectories in the space-fixed system and in the center-of-mass 
system look like? 



Fig. 6.6. Laboratory system 
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Solution. Newton’s gravitational law immediately yields 


ri = -- 


Gni2Y 


r 3 ’ r2 = r 3 • 

With the relative coordinate r = ri — r 2 , it follows that 

Gm 2 (ri-r 2 ) , .. G/n i(ri-r 2 ) 
r i =-j- and r 2 =-j-. 

In the center-of-mass system, we have m\X\ = — 1712 X 2 

— G(m\+m2)x\ .. —G(mi+m2)r2 

=>> ri =-«- and r 2 =-«-. 


Gm\v 


G(m\ + m 2 )x 


Subtraction yields 
r = Pi - r 2 = - 
Since 


m 2 , mi 

ri =- r and r 2 =-r, 


mi + m 2 
it follows that 


mi + m2 


—Gm^ri —Gm\X2 

r i = ^- : -^ and r 2 - 


(mi + m 2 ) 2 r\ 


(mi + m2) 2 ^2 


2 r 3 • 


Hence, Newton’s gravitational law holds with respect to the center of gravity, but 
with modified mass factors. This means that the trajectories are conic sections as be- 
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fore (relative path with respect to S ). Because of the superimposed translation of the 
center of gravity, the trajectories become spirals in space. 


EXERCISE 


6.5 Atwoods Fall Machine 


Problem. Two masses ( m\ = 2kg and m 2 = 4kg) are connected by a massless rope 
(without sliding) via a frictionless disk of mass M = 2 kg and radius R = 0.4 m (At¬ 
woods machine). Find the acceleration of the mass m 2 = 4 kg if the system moves 
under the influence of gravitation. 



Solution. For the given masses m\ = 2kg, m 2 = 4kg and the tension forces at the 
rope ends Ni and N 2 , it follows that 

m\ci\ — N\ — m\g, m 2 d 2 = m 2 g — N 2 , (6.12) 

and for the torques acting on the disk, we get 

D x + D 2 = -NiR + N 2 R = R(N 2 - Ni) = d)6 s , (6.13) 

since the disk is accelerated. 0 S is the moment of inertia of the disk. From this, it 
follows that N 2 7 ^ N\ ; otherwise, there is no motion at all. For the accelerations, we 
have 


a = a\ = ci 2 = o)R, 


(6.14) 


since the rope is tight and does not slide, i.e., it adheres to the disk. 

Inserting the moment of inertia of the disk 0 S = MR 2 /2 (see Example 11.7) 
into (6.13) and using (6.14) yields for the acceleration 


N 1 N 2 

a = - g = g -= 

m 1 m 2 


coR = 


R 2 

MR 2 /2 


(N 2 -N 1 ). 


(6.15) 


Exercise 6.4 
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Inserting (6.12) and performing the algebraic steps yields 


a 


2 

M 


(N 2 - Ni) = 


g(m 2 - m\) - m 2 a 2 - m\a\ 

W/2 


and, because a = a\ =a 2 , 


_ aM/2 - g(m 2 - m\) + a(m 2 + m\) 

W/2 

a(m\ + m 2 + M/2 ) — g(m 2 — m\) 

~ W/2 

g(m 2 — m\) 

=> a =-. 

m\ + m 2 + M/2 

The Atwoods machine serves as a transparent and easily controllable demonstration 
of the laws of free fall. By varying the difference of the masses ( m 2 — m i), the accel¬ 
eration a can be varied. 


EXERCISE 


6.6 Our Solar System in the Milky Way 



Problem. Our solar system is about ro ~ 5 • 10 20 m away from the center of the Milky 
Way, and its orbital velocity relative to the galactic center vo is ~ 3 • 10 5 m/s. This is 
schematically shown in Fig 6.9. 

(a) Determine the mass M of our galaxy. 

(b) Discuss the hypothesis that the motion of our solar system is a consequence of the 
contraction of our Milky Way (see Fig. 6.9), and then verify, ro = GM/v^. Here 
G = 6.7 • 10 -11 m 3 s 2 kg -1 is the gravitational constant. 

Solution, (a) If a mass point moves on a circular path, then according to Newton 
the force per unit mass equals the acceleration. Since our sun (mass m) is at the pe¬ 
riphery of our Milky Way, the attractive force toward the center can approximately be 
represented by 


F = G- 


mM 


(6.16) 


where m is the solar mass and M is the mass of the Milky Way. The acceleration 
points toward the center, 



ro 


F 
m ’ 


(6.17) 


from which it follows that 


_o 

ro 


GM 


GM 


or r 0 : 


(6.18) 
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Using the numbers given in the formulation of the problem, one gets from equation 
(6.18) the mass of our Milky Way: 


r 0 v% ~ 5 • 10 20 • 9 • 10 10 
~G~ ~ 6.7-10- 11 


kg = 6.7 • 10 41 kg. 


This means that the mass of the Milky Way is 


M&3- 10 n m, 


where m is the solar mass. 


(b) If r, v are the initial values for the distance and velocity of our sun, for the 
available energies we have 


G Mm 1 2 

kpot =--— and r kin = -mv , 


(6.19) 


where M is the mass of the Milky Way, and G is the gravitational constant. If the 
sun moves with decreasing radius about the center of the Milky Way, the angular 
momentum about the center remains constant; however, the orbital velocity increases. 
Hence, the kinetic energy T^\ n can be given as a function of the radius 


1 l 2 U 2 1 

T= ^ m ~T2 = o-2’ 

2 m z r z 2 m r z 


( 6 . 20 ) 


where we used / = ( mr 2 )co = mvr = constant. 

The assumption is now that at the present distance r the increase in the kinetic 
energy A7ki n is balanced by the decrease in the potential energy if r is reduced by Ar. 
Differentiation of (6.19) and (6.20) with respect to r yields: 


ATkin = 


/ c/7ki n 

\~d7~ 


i 2 i 

Ar =- t-A r, 

m r 5 


AVpot = 



GMm 

r 2 


Ar, 


ATkin > 0, if Ar < 0, 
AVpot < 0, if Ar < 0. 


In the equilibrium, however, A7ki n + A V pot = 0. Replacing / by l = mvoro yields 


2 2 2 
m Wo 

mr q 



or ro^o = MG. 


( 6 . 21 ) 


Exercise 6.6 


Equation (6.21) again corresponds exactly to the result of problem (a). 













Vibrations of Coupled Mass Points 


/ 


As the first and most simple system of vibrating mass points, we consider the free 
vibration of two mass points, fixed to two walls by springs of equal spring constant, 
as is shown in the Fig. 7.1. 


Fig. 7.1. Mass points coupled 
by springs 


— '-&UUUXT' -(T)—T 50 CWT—(2)—TOTT— 


-kx l k(x 2 -x l ) 




J) —wp— 0-^Kmr—« 


The two mass points shall have equal masses. The displacements from the rest 
positions are denoted by x\ and * 2 , respectively. We consider only vibrations along 
the line connecting the mass points. 

When displacing the mass 1 from the rest position, there acts the force — kx\ by the 
spring fixed to the wall, and the force +k(x 2 — x\) by the spring connecting the two 
mass points. Thus, the mass point 1 obeys the equation of motion 


mx 1 = —kx 1 + k(x 2 — x\). 
Analogously, for the mass point 2 we have 


(7.1a) 


mx 2 = —kx 2 — k(x 2 — x\). (7.1b) 

We first determine the possible frequencies of common vibration of the two particles. 
The frequencies that are equal for all particles are called eigenfrequencies. The re¬ 
lated vibrational states are called eigen- or normal vibrations. These definitions are 
correspondingly generalized for a A-particle system. We use the ansatz 


X\ = A \ COS COt, X 2 = A 2 cos cot, (7.2) 

i.e., both particles shall vibrate with the same frequency co. The specific type of the 
ansatz , be it a sine or cosine function or a superposition of both, is not essential. 
We would always get the same condition for the frequency, as can be seen from the 
following calculation. 
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Insertion of the ansatz into the equations of motion yields two linear homogeneous 
equations for the amplitudes: 

A\(—moo 2 + 2k) — A^k = 0, 

9 (7.3) 

—A\k + A'li—mod 1 + 2k) = 0. 


The system of equations has nontrivial solutions for the amplitudes only if the deter¬ 
minant of coefficients D vanishes: 


D = 


—moo 2 + 2k 
—k 


—k 

—moo 2 + 2k 


(—moo 2 + 2k) 2 — k 2 — 0. 


We thus obtain an equation for determining the frequencies: 

a k 9 k 2 

co 4 — 4—co 2 + 3— 9 = 0. 
m m l 

The positive solutions of the equation are the frequencies 

[3k t IT 

oo \ = ,/ — and 002 = J —. 

V m V m 

These frequencies are called eigenfrequencies of the system; the corresponding vibra¬ 
tions are called eigenvibrations or normal vibrations. To get an idea about the type 
of the normal vibrations, we insert the eigenfrequency into the system (7.3). For the 
amplitudes, we find 

A\ = —A 2 for 00 \ = 



and 


A\ = A 2 for 0)2 = 

The two mass points vibrate in-phase with the lower frequency 002 , and with 
the higher frequency 00 \ against each other. The two vibration modes are illustrated 
by Fig. 7.2. 



Fig. 7.2. 


—iHQQQQ;—^^^—dlQQQJb— 

1-► «-1 


1-► 1-► 


<-1 1-► 

co-i: opposite-phase vibration 


^—1 «—1 

ol> 2 : in-phase vibration w-| > ca 2 . 


The number of normal vibrations equals the number of coordinates (degrees of 
freedom) which are necessary for a complete description of the system. This is a con¬ 
sequence of the fact that for N degrees of freedom there appear N equations of the 
kind (7.2) and N equations of motion of the kind (7.1a), (7.1b). This leads to a de¬ 
terminant of rank N for 00 2 , and therefore in general to N normal frequencies. Since 
we have restricted ourselves in the example to the vibrations along the v-axis, the two 
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coordinates x\ and X2 are sufficient to describe the system, and we obtain the two 
eigenvibrations with the frequencies mi, m2. 

In our example, the normal vibrations mean in-phase or opposite-phase (= in-phase 
with different sign of the amplitudes) oscillations of the mass points. The amplitudes 
of equal size are related to the equality of masses ( m\ = m 2 ). The general motion 
of the mass points corresponds to a superposition of the normal modes with different 
phase and amplitude. 

The differential equations (7.1a), (7.1b) are linear. The general form of the vibration 
is therefore the superposition of the normal modes. It reads 

X\(t) = C 1 COS (cO\t + (pi ) + C*2 COS(cl>2^ + <Pl), 

(7.4) 

X 2 (0 = -Cl COS((i>it + (pi) + C2 COS (m2 + (P2)- 

Here, we already utilized the result that x\ and X2 have opposite-equal ampli¬ 
tudes for a pure mi -vibration, and equal amplitudes for pure m2 -vibrations. This en¬ 
sures that the special cases of the pure normal vibrations with C 2 = 0, Ci ^ 0 and 
Cl = 0 , C2 / 0 are included in the ansatz (7.4). Equation (7.4) is the most general 
ansatz since it involves 4 free constants. Thus one can incorporate any initial values 
for v 1 (0), X2 (0) , x 1 (0), X2 (0). 

For example, the initial conditions are 


xi(0) = 0, X2(0) = a, ii(0) = X2(0) = 0. 

To determine the 4 free constants Ci, C 2 , <pi, <P2, we insert the initial conditions 
into the equations (7.4) and their derivatives: 


xi( 0 ) = Ci cos^i + C 2 Cos <^2 = 0, (7.5) 

X 2 (0) = — Ci cos (p\ C 2 cos (p 2 — ci 5 (7.6) 

i^O) = — Cimi sin^i — C 2 C 02 sin ^2 = 0, (7.7) 

i 2 (0) = Ci mi sin^i — C 2 C 02 sin ^2 = 0- (7.8) 

Addition of (7.7) and (7.8) yields 
C 2 sin (p2 = 0 . 

Subtraction of (7.7) and (7.8) yields 


Ci sin^i = 0. 


From addition and subtraction of (7.5) and (7.6), it follows that 
2 C 2 cos <p2 — & and 2Ci cos^i = — a. 

Thus, one obtains 


Ci = 


a 


2 ’ 



(Pl =( P2 = 0 , 
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The overall solution therefore reads 


x\(t) — — (— cos co\t + coso^O = a sin 


X 2 (t) = — (cos co\t + cosco 2 t) = 


a cos^ 


00 1 — 002 
2 

00 1 - £02 


£ sin 


00\ + 002 


t , 


/ft)l +<M 2 

£cos - L 


For t = 0: vi(0) = 0, X2(0) = a, as required. The second mass plucks at the first one 
and causes it to vibrate. These are beat vibrations (see Exercise 7.2). 


EXERCISE 

7.1 Two Equal Masses Coupled by Two Equal Springs 


Problem. Two equal masses move without friction on a plate. They are connected 
to each other and to the wall by two springs, as is indicated by Fig. 7.3. The two 
spring constants are equal, and the motion shall be restricted to a straight line (one¬ 
dimensional motion). Two equal masses coupled by two equal springs. 

Find 

(a) the equations of motion, 

(b) the normal frequencies, and 

(c) the amplitude ratios of the normal vibrations and the general solution. 


Fig. 7.3. 


k m k m 

■—TRXftftr—(7)—— ( 2 ) 

1 -► 1 -► 

Vi * 2 


Solution, (a) Let x\ and X 2 be the displacements from the rest positions. The equa¬ 
tions of motion then read 

mx 1 = —kx 1 + k(x 2 — x\), (7.9) 

mx 2 = —k(x 2 — x\). (7.10) 

(b) For determining the normal frequencies, we use the ansatz 
X\=A\ COS cot, X2 = A2 cos cot 


and thereby get from (7.9) and (7.10) the equations 

(2k — mco 1 2 * * * * * * 9 )A\ — kA 2 = 0, 

9 (7.11) 

—kA\ + (k — mco z )A2 = 0. 


From the requirement for nontrivial solutions of the system of equations, it follows 
that the determinant of coefficients vanishes: 


2 k — moo 2 
—k 


= 0 . 


D = 


—k 

k — moo 2 
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From this follows the determining equation for the eigenfrequencies, 

A k ^ k 2 

(o — 3 —co + —-z — 0, 
m m l 

with the positive solutions 

75 +1 IT , 75-1 [T 

co\ = - J— and 002 =- A /—, co\ > a> 2 . 

2 V m 2 V m 

(c) By inserting the eigenfrequencies in (7.11) one sees that the higher frequency 
co\ corresponds to the opposite-phase mode, and the lower frequency 002 to the equal- 
phase normal vibration: 


with 


with 


»3 = I(3-V5)1. 


it follows from (7.11) that 


it follows from (7.11) that 


M = ~ 


V5-1 


A ^ +l A 

A 2 = —-— A\. 


Since the two mass points are fixed in different ways, we find amplitudes of different 
magnitudes. 

The general solution is obtained as a superposition of the normal vibrations, using 
the calculated amplitude ratios: 


X\{t) = C 1 COS(cO\t + cp\) + C2COS(c02t + (f> 2 ), 

V5-1 75 +1 

X2(t) =---Cl COS (00\t + (pi) H--- C2C0S(C02t + (^ 2 )- 

The 4 free constants are determined from the initial conditions of the specific case. 


Exercise 7. / 


EXERCISE 


7.2 Coupled Pendulums 

Problem. Two pendulums of equal mass and length are connected by a spiral spring. 
They vibrate in a plane. The coupling is weak (i.e., the two eigenmodes are not very 
different). Find the motion with small amplitudes. 




Solution. The initial conditions are 


xi (0) = 0, *2(0) = A, 


ii(0) =*2(0) =0. 
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\ 

We start from the vibrational equation of the simple pendulum: 


ml a = —mg sin o'. 


For small amplitudes, we set sin a = a = x/1 and obtain 


g 

i 

Fig. 7.5. 

mx — —m—x. 

1 

For the coupled pendulums, the force =F^(-*l — * 2 ) caused by the spring still enters, 
which leads to the equations 

Xl = -jX l --(x l -X 2 ), 

1 ™ (7.12) 

g k 

X2 = -~X2 H- (X\ -X2). 

1 m 

This coupled set of differential equations can be decoupled by introducing the coordi¬ 
nates 

u\ = x\ — X 2 and U 2 =x\-\-X 2 . 

Subtraction and addition of the equations (7.12) yield 

8 (8 , r. k \ 

u 1 = — — u\ —2 — u\ — — I — + 2— I mi, 

/ m \ l m ) 

g 

U2 = -yM2. 

These two equations can be solved immediately: 

u\ — A\ coso;^ + B\ sinooit, 

(7.13) 

U2 = A 2 COSC02t + B 2 sin6t>2^ 

where co\ = y/g/l + 2{k/m ), C 02 = \Z~gJl are the eigenfrequencies of the two vibra¬ 
tions. The coordinates u\, U 2 are called normal coordinates. Normal coordinates are 
often introduced to decouple a coupled system of differential equations. The coor¬ 
dinate mi = x\ — X 2 describes the opposite-phase and M 2 = x\ + X 2 the equal-phase 
normal vibration. The equal-phase normal mode proceeds as if the coupling were ab¬ 
sent. 

For sake of simplicity, we incorporate the initial conditions in (7.13). For the nor¬ 
mal coordinates we then have 

U\(0) = —A, M2(0) = A, Mi(0) = M2(0) = 0. 

Insertion into (7.13) yields 

A\ = —A, A 2 = A, B 1 = B 2 = 0, 

and thus, 


u\ = — Acosco\t, 


U2 = Acosco2t. 
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Returning to the coordinates x\ and X 2 '- 
1 A 

X\ = -(u l + U2) = — (— COSOq£ + COS(L)2t), 

1 A 

X2 = -(m 2 — w i) = ~( cos< ^l t + coso)2t). 
After transforming the angular functions, one has 


x\ = A sin 


(Ol - 002 


t sm 


oo\ + 002 


2 

00 \ + 002 


t , 


(OOl- 002 \ {(01+0)2 \ 

X 2 = A cos I -—- 1 I cos I -—- 1 I 


We have presupposed the coupling of the two pendulums to be weak, i.e., 



hence, the frequency oo\ — 002 is small. The vibrations xi (t) and X 2 (t) can then be 
interpreted as follows: The amplitude factor of the pendulum vibrating with the fre¬ 
quency 001 + 002 is slowly modulated by the frequency 001 — 002 . This process is called 
beat vibration. Figure 7.6 illustrates the process. The two pendulums exchange their 
energy with the amplitude modulation frequency co\ — 002 . If one pendulum reaches 
its maximum amplitude (energy), the other pendulum comes to rest. This complete 
energy transfer occurs only for identical pendulums. If the pendulums differ in mass 
or length, the energy transfer becomes incomplete; the pendulums vary in amplitudes 
but without coming to rest. 



Fig. 7.6. 
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7.1 The Vibrating Chain 

We consider another vibrating mass system: the vibrating chain. The “chain” is a mass¬ 
less thread set with N mass points. All mass points have the mass m and are fixed to 
the thread at equal distances a. The points 0 and A + 1 at the ends of the thread are 
tightly fixed and do not participate in the vibration. The displacement from the rest po¬ 
sition in y-direction is assumed to be relatively small, so that the minor displacement 
in v-direction is negligible. The total string tension T is only due to the clamping of 
the end points and is constant over the entire thread. 

If one picks out the vth particle, the forces acting on this particle are due to the dis¬ 
placements of the particles (v — 1) and (v + 1). According to Fig. 7.7 the backdriving 
forces are given by 

F v _i = —{T • sino')e2, 

F v -}-i = — (T • sin/?)e 2 . 


Fig. 7.7. 



Since the displacement in y-direction is small by definition, a and are small 
angles, and hence, one has, to a good approximation, 

sin a = tan a and sin = tan p. 

From Fig. 7.7, one sees that 

yv — yv —i , Q yv — jv+i 

tan a =- and tan p =-. 

a a 

Hence, the forces are given by 


^ rj, I Jv yv -1 

Fy -1 = -T\ - e 2 , 


TTI rr I y» Jv+1 

F v +i = -T\ - )e 2 . 


1 It is recommended that the reader go through Chap. 8 (“The Vibrating String”) before studying 
this section. The concepts presented here will be more easily understood, and the mathematical ap¬ 
proaches will be more transparent in their physical motivation. 















7.1 The Vibrating Chain 


89 


The total backdriving force is the sum F y _i + F v +i, i.e., the equation of motion for 
the particle reads 


dy v rr(yv~yv -1 

m ^=~ T 


e2 ■ 


Jv - Jv+l 

a 


e2 


or 


d 2 y v 
dt 2 


— (y v _ i — 2y v + y v +i). 
ma 


(7.14) 


Since the index v runs from v = 1 to v = N, one obtains a system of N coupled dif¬ 
ferential equations. Considering that the endpoints are fixed, by setting for the indices 
v = 0 and v = N + l 


yo = 0 and yw+l =0 (boundary condition), 


one obtains from the differential equation (7.14) with the indices v = 1 and v = N the 
differential equation for the first and last particle that can participate in the vibration: 


d 2 yi 


m 


d^yN 

dt 2 


-(-2yi +y 2 ), 

a 

T 

— (yN-i — 2yiv). 
a 


(7.15) 


We now look for the eigenfrequencies of the particle system, i.e., the frequencies 
of vibration common to all particles. To get a determining equation for the eigenfre- 
quency co n , we introduce in (7.14) the ansatz 


y v ~ A v cos cot. 


(7.16) 


We obtain 


—mco z • A v • cos cot = — (A v -i — 2 A v + A v +\) cos cot, 
a 


and after rewriting, 
—A v -1 + 


( 2 - 


maco 


A v - A v+ i =0, v = 2, 1. 


(7.17a) 


By insertion of (7.16) into (7.15), we get the equations for the first and the last vibrat¬ 
ing particle: 


maco 


-An-i + ( 2 - 


maco 


Ai - A 2 = 0, 

2n 


(7.17b) 


An = 0 . 


With the abbreviation 
2 T — maco 2 


= c. 


(7.18) 
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(7.17a) and (7.17b) can be rewritten as follows: 

0 
0 
0 

— An -i + cAn= 0. 

This is a system of homogeneous linear equations for the coefficients A v . For any 
nontrivial solution of the equation system (not all A v = 0) the determinant of coeffi¬ 
cients must vanish. This determinant has the form 


c 

-1 

0 

0 

0 ... 

0 

0 

0 

-1 

c 

-1 

0 

0 ... 

0 

0 

0 

0 

-1 

c 

-1 

0 ... 

0 

0 

0 

0 

0 

0 

0 

0 ... 

-1 

c 

-1 

0 

0 

0 

0 

0 ... 

0 

-1 

c 


It has N rows and N columns. The eigenfrequencies are obtained as solution of the 
equation 

Dn = 0 . 

Expanding Dn with respect to the first row, we get 


c 

-1 

0 


0 

0 

0 

-1 

c 

-1 


0 

0 

0 

0 

-1 

c 


0 

0 

0 

0 

0 

-1 


0 

0 

0 

0 

0 

0 


-1 

0 

0 

0 

0 

0 


c 

-1 

0 

0 

0 

0 


-1 

c 

-1 

0 

0 

0 


0 

-1 

c 

-1 

-1 

0 

0 


0 

0 

0 

c 

-1 

0 


0 

0 

0 

-1 

—c 

-1 


0 

0 

0 

0 

-1 

c 


0 

0 

0 

0 

0 

0 


-1 

0 

0 

0 

0 

0 


c 

-1 

0 

0 

0 

0 


-1 

c 


The left-hand determinant has exactly the same form as Dn, but is lower by one 
order (TV — 1 rows, TV — 1 columns). It would be the determinant of coefficients for 
a similar system with one mass point less, i.e., Dn-\. The right-hand determinant is 


cA\ - A 2 
—A\ + cA 2 — A 3 

— A 2 + cA 3 — A\ 
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now expanded with respect to the first column, which leads to 


D n = cD n _\ + (-1) • 


c 

-1 

0 ... 

0 

0 

-1 

c 

-1 ... 

0 

0 

0 

-1 

c 

0 

0 

0 

0 

0 ... 

-1 

0 

0 

0 

0 ... 

c 

-1 

0 

0 

0 ... 

-1 

c 


The last determinant is just £>#-2- Hence we get the determinant recursion equa¬ 
tion 


Dn = cDn-i — Dn—2 > if TV > 2. 
Moreover, 

-1 


D] = \c\ = c and D 2 = 


-1 


= L — 1 . 


(7.19) 


(7.20) 


By setting TV = 2 in (7.19), we recognize that (7.19) combined with (7.20) is satisfied 
only if we formally set 

A) = l. (7.21) 

Our problem is now to solve the determinant equation (7.19). We use the ansatz 
Dn = P N , 

where the constant p must be determined. Insertion into (7.19) yields 

p N = C p N - l -pN-\ 


N—2 


and after division by p 


p — cp +1=0 or p 


■ zb a/c 2 — 4 


The mathematical possibility p N ~ 2 = 0 that leads to p = 0 does not obey the bound¬ 
ary condition Dq = \ and is therefore inapplicable. Substituting c — 2 cos ©, we obtain 
for p 


> = cos © d= yj cos 2 © — 1 = cos © =b i sin © = e 


±i© 


The solutions of (7.19) are then 

D N = p N = 0 e m ) N = e iN0 = cos TV© + i sin TV© 
and 

= (^ ,w ) r 


= cos TV© — i sin TV©. 
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Since the equation system (7.19) is homogeneous and linear, the general solution is a 
linear combination of cos N& and sin N@: 


D n = G cos TV© + H sin TV©. (7.22) 

Since Do = 1 and D\—c — 2 cos © (see above), G and H are determined as 
G = 1, 77 = cot ©, 

so that 

sin Af© cos© sinW + l)© 

D n =cos7V© +-= —--—, 

sin © sin © 

because sin © cos N® + sin NS cos © = sin(N + 1)©. 

For any nontrivial solution of the equation system we must have Dn = 0, i.e., Dn 
must vanish for all N; it follows that 

sin((7V + l)©) = 0, 


or 

® = ®n = n = l,...,N. (7.23) 

n — 0 drops out since it leads to the solution ©o = 0, and hence to Dn — N + 1 
^ 0, and thus does not lead to a solution of the equation Dn = 0. For c we then get 
according to (7.18): 


c = 2 — 


co 2 ma 

T 


= 2 cos 


nir 

N + l' 


and co is calculated from 


IT 


co =ft> (n) = 


ma 


( nix \ 
1 — cos- 

N + lJ 


as 


0)(n) = 1 


127T 

1 1 — cos-. 

ma V N + l 


(7.24a) 


(7.24b) 


These are the eigenfrequencies of the system ; the fundamental frequency is obtained 
for n = 1 as the lowest eigenfrequency. There are exactly N eigenfrequencies, as is 
seen from (7.23): For n > Af + 1, we set n = (N + 1) + t and find 


If one inserts the above expression into (7.17a) and (7.17b) for co and c, respectively, 
one obtains for the amplitudes of the normal vibration 


-A++2Ay 


cos ■ 


nit 

aTT"i 


— A 


(n) 


2 cos 


2cos 


v+l 

nix 

N + l 
nix 

N + l 


0, 


i(«) 


iV—1 


(7.25) 
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where the A v depend on n (A v = A jf^). The system of equations (7.25) for the A v 
is the same as that for the determinants Dn (equation (7.19)), with the same coeffi¬ 
cient c = 2cosmt/(N + 1) = 2cos0 n . Only the boundary conditions (7.25) do not 
correspond to those for the D # (see (7.20) and (7.21)). The general solution for the 
coefficients A v is therefore obtained from (7.22) with at first arbitrary coefficients 
E {n) : 

A^ = cos v 0 n + E^ sin v 0 „, 


or, in detail, 


A^ = E\ n) cos n7TV + E^ sin 

v 1 AT _L 1 2 


A + 1 


/urv 

7v + r 


(7.26) 


Since the points v = 0 and v = TV + 1 are tightly clamped, for all eigenmodes n we 
have yo = yw+i = 0 , or 


A = A$ +l = 0 (boundary condition). 


Then one obtains for v = 0 in (7.26): 


= 0 , i.e., A< n) 


sin 


nix v 

n a r 


After insertion into (7.16), one gets 

y^ = E^ sin-cos coaot. 

2 A + l 


(7.27) 


If one inserts y v = B v sin cot instead of (7.16) into (7.14), one determines B v by the 
same method as A v and obtains 


B (n) _ E (n) j (£ W = 0 ). 

v 4 N + l 3 

hence, the solutions for the y v read 

= E^ sin-cos coaot 

yv 2 7V + 1 w 

and 


( n \ (n) . ^TT V 

yf* = E\ sin sin co (n) t. 


(7.28a) 


(7.28b) 


The sum of these individual solutions yields the general solution, which therefore 
reads 


N 

E nirv / \ 

sin——— (£4 sm<w (n) r + E\ 'cos co(„)t) 

n =1 


N 

E nirv . 

sm N ^ (a n sin co {n) t + b n cos (o in )t), 

n =1 


(7.29) 


where the constants E^ and £ 4 ^ were renamed b n and respectively. They are 
determined from the initial conditions. 
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7 Vibrations of Coupled Mass Points 


The equation of the vibrating chord must follow from the limit for N —> oc and 
a -> 0 (continuous mass distribution): 


sin 


mtv 

V + 1 


lim 

N-^-oo 
a —>0 



\ 

l -\- d ) 


sin 


sin 


sin 


nnav 
(N + l)a 

7tn(av) 
l -\- CL 

irnx 

~r 


(x v = av takes only discrete values) 
(/ = Na is the length of the chord) 
(x continuous). 


co 2 ^ becomes (expansion of the cosine in (7.24a) in a Taylor series): 
2 _ 2 T / 1 / nn \ 2 \ r(» 7 r) 2 

^ may 2\N + l) J (m/a)(N + l) 2 a 2 ’ 


and with cr =m/a = mass density of the chord, 
/ T{mt) 2 \ _ T(nnf 

n^oo\<j(N + l) 2 a 2 J al 2 

a —>0 


i.e., 


CO(n) 



Hence, one has as a limit 


y n (x) = sin 


mrx \ 

) 



(7.30) 


This is the equation for the nth eigenmode of the vibrating chord (/ is the chord length). 
It will be derived once again in the next chapter in a different way and will then be 
discussed in more detail. 


EXERCISE 


7.3 Eigenfrequencies of the Vibrating Chain 


Problem. When solving the determinant equation (7.19), we have made a mathemat¬ 
ical restriction for c by setting c = 2 cos ©. 

Show that for the cases 

(a) k| = 2, 

(b) c < -2 

the eigenvalue equation = 0 cannot be satisfied. Clarify that thereby the special 
choice of the constant c is justified. 

Solution, (a) 


Dn — cD n —\ D n — 2, 


Di=c = ± 2 , 


A) = l. 


( 7 . 31 ) 
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We assert and prove by induction 

|D n |>|D n _i|. (7.32) 

Induction start: n = 2, |Z>o| = 1, \D\\=2, |Z> 2 |= 3 . 

Induction conclusion from n — l, n — 2 to n: 

I Ail 2 = 4| A-il 2 ±4| A-ill A-il +1A 1 - 2 I 2 

>4|D„_i| 2 + |D„_ 2 | 2 -4|D b _i||£>„_ 2 | 

=► IA I 2 - \D n -i\ 2 > 3|Z)„_! | 2 + |D „_ 2 | 2 - 4|D„_ 1 ||D„_ 2 |. 

According to the induction condition, 
lAz-ll = IA 1 - 2 I + e with e > 0 . 


From this, it follows that 

|D„| 2 -|L»„_i| 2 >4|D„_2 | 2 + 6e|D„_ 2 |+3e 2 -4e|Z)„_2|-4|Z)„_2 | 2 
>2e|D„_ 2 | 

>0 

=► \D n \>\D n -i\. (7.33) 


Since | D n | monotonically increases in n , and | D\ | = 2 > 0, we have | | >0. There¬ 

fore Dn = 0 cannot be satisfied, co = 0 and co = «JlT/ma are not eigenfrequencies 
of the vibrating chain. 

(b) By inserting the ansatz D n = Ap n , p ^ 0, we also find the solution of the 
recursion formula D n = cD n -\ — D n - 2 , D\ = c. Do = 1: 


/ ? i = i(c + (c 2 -4) 1 /2)<ol 
/D = 3 (c - (c 2 -4) 1/2 ) < 0 } 


0 > Pl > P2- 


(7.34) 


The general solution for incorporating the boundary conditions Do = 1, D\=c reads 


D„ =Aip1 +A 2 P2- 


(7.35) 


With Do = 1, D| = c, it follows that 


A\ + A 2 = 1, 

^(c + <c 2 -4) 1 ' 2 ) + £(c-(c 2 -4)>' 2 ) = c, 
c + (c 2 -4 ) x / 2 

A,= W 2 -4)'/2 * 2 


-c + (c 2 — 4 ) 1 / 2 

2(c 2 -4)!/2 


. (7.36) 


One then has 

„ _ 1c + (c 2 -4)!/2 „ _ 1 (c 2 — 4) 1 / 2 — C „ 
" 2 (c 2 — 4)1/2 A+ 2 (c 2_ 4)1/2 A 

1 / n +1 11+14 


(c 2 — 4 ) 1 / 2 


(K + 1 -t 2 " +1 )- 


Exercise 7.3 


(7.37) 
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7 Vibrations of Coupled Mass Points 


To determine the physically possible vibration modes, we had required that = 0: 


On = 0 



(7.38) 


But now 0 > p\ > p 2 , hence (p2/pi) N + l > 1. Thus, for the case c < —2 eigenfre- 
quencies do not exist too. 

These supplementary investigations can be summarized as follows: The possible 
eigenfrequencies of the vibrating chain lie between 0 and A J2T/ma: 


0 < \(o\ < 



(7.39) 


EXERCISE 


7.4 Vibration of Two Coupled Mass Points, Two Dimensional 


Problem. Two mass points (equal mass m ) lie on a frictionless horizontal plane and 
are fixed to each other and to two fixed points A and B by means of springs (spring 
tension T, length /). 

(a) Establish the equation of motion. 

(b) Find the normal vibrations and frequencies and describe the motions. 



Fig. 7.9. 


r 




4 


o>* = 3k 




0)2 — /f 


Solution, (a) For the vibrating chain with n mass points, which are equally spaced 
by the distance l, the equations of motion 

d 2 y N T 

df 2 = ~ ^ yN + yA ' +1 ' ) (^ = 1 , ...,n) 

were established. For the first and second mass point, we have 


yi = k{y o - 2yi + = k(y 2 - 2yi), 

h = k(y\ - 2y 2 + y 3 ) = k(y i - 2y 2 ) 


(7.40) 


7.1 The Vibrating Chain 
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with k = T/ml\ the chain is clamped at the points A and B, i.e., = 0. 

(b) Solution ansatz : yi = Ai coscot, y 2 = A 2 cos cot (co = eigenfrequency). Inser¬ 
tion into (7.40) yields 


(2k — co 2 )A\ — kA2 = 0, 
(2k - co 2 )A 2 - kAi = 0 . 


(7.41) 


To get the nontrivial solution, the determinant of coefficients must vanish, i.e., 


D = 


2k — co 2 
—k 


—k 

2k — co 2 


= 0 ; 


i.e., co 4 + 3 k 2 — 4 kco 2 — 0 , from which it follows that co 2 = 3k, co\ — k. 

Insertion in (7.41) yields A\ = A 2 for C 02 and A\ = — A 2 for co\. This is an 
opposite-phase and an equal-phase vibration, respectively. We note that the vibration 
with the higher frequency has opposite phases and a “node,” while the vibration with 
lower frequency has equal phases and a “vibration antinode.” 


EXERCISE 

7.5 Three Masses on a String 


Problem. Three mass points are fixed equidistantly on a string that is fixed at its 
endpoints. 

(a) Determine the eigenfrequencies of this system if the string tension T can be con¬ 
sidered constant (this holds for small amplitudes). 

(b) Discuss the eigenvibrations of the system. Hint : Note Exercises 8.1 and 8.2 in 
Chap. 8 . 



Fig. 7.10. 


Solution, (a) For the equations of motion of the system, one finds straightaway 



= 0 , 

= 0, (7.42) 

= 0 . 


Assuming periodic oscillations, i.e., solutions of the form 


x\ = A sin(&tf + \[r), 
X 2 = B sin (cot + t/0, 
X 3 — C sin(&tf + V0, 


x\ = —co 2 A sin (cot + V0, 
X 2 = —co 2 B sin^ + V0, 
jc‘ 3 = —co 2 C sin (cot + V0, 
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7 Vibrations of Coupled Mass Points 


Exercise 7.5 


we get after insertion into (7.42) 


- (jo 2 m I A — I — ) B = 0, 


-0 


(7.43) 


(^5- -co 2 tnjc = 0 . 


As in Exercise 8.2, one gets the equation for the frequencies of the system from the 
expansion of the determinant of coefficients: 




GT+ ■ 


10Lm 


-a/ - 4 = 0 


or 


/ Lm \ 3 , / Lm \ 2 7 lOLm 

(—) S2 3 -6(—J ^+—-^i-4 = 0 

with £2 = co 2 . This cubic equation with the coefficients 


(7.44) 


a = ^ 


Lm\ 3 

~Y) ’ 


■At) 1 - 


10 Lm 


d = —4 


can be solved by Cardano’s method. 
With the substitutions 


y — £2 + 


3 a 


1 b 2 c _ T 2 

P 3 a 2 a L 2 m 2 ’ 


2 b 3 l be d 

2 ^ — - — — 0 
27 a* 3 a A a 


we get q 2 + p 3 < 0 , i.e., there are three real solutions which by using the auxiliary 
quantities 

q 

cos <p = -^== 0 , yi = 

V-p 3 

y2 = 

y3 = 


can be calculated as 



-2V^cos(| + | 


= 0 , 


2 +J—p cos — = a /2 - 

' 22 /rn 


(b) From the first and third equation of (7.43), one finds for the amplitude ratios 


B B ^ mLco 2 
A = C = T 


(7.45) 
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Discussion of the modes: 

(1) 0 ) = C 01 = ( 0.6T/Lm)*/ 2 inserted into (7.45) =>• Bi/Ai = B\/C\ = 1.4 or B { = 
1.4 A i = 1.4Ci. 

All three masses are deflected in the same direction, where the first and third 
mass have equal amplitudes, and the second mass has a larger amplitude. 

(2) co = co2 — (2 T / Lm ) 1 / 2 inserted into (7.45) =>► B2/A2 = B2/C2 = 0 and A2 = 
—C2 from the second equation of (7.43). The central mass is at rest, while the first 
and third mass are vibrating in opposite directions but with equal amplitude. 



Fig. 7.11. 



Fig. 7.12. 


(3) co = (02 = (3AT/Lm ) 1 / 2 inserted into (7.45) =>► B 3 /A 3 = B 3 /C 3 = —1.4, i.e., 
A3 = C3 = — 1.4£3. The first and the last mass are deflected in the same direction, 
while the central mass vibrates with different amplitude in the opposite direction. 

The system discussed here has three vibration modes with 0, 1, and 2 nodes, re¬ 
spectively. For a system with n mass points, both the number of modes as well as 
the number of possible nodes (n — 1) increases. A system with n -> 00 is called a 
“vibrating string.” 

A comparison of the figures clearly shows the approximation of the vibrating string 
by the system of three mass points. 



^3 



1.4A 3 


Fig. 7.13. 



Fig. 7.14. 


EXERCISE 


7.6 Eigenvibrations of a Three-Atom Molecule 


Problem. Discuss the eigenvibrations of a three-atom molecule. In the equilibrium 
state of the molecule, the two atoms of mass m are in the same distance from the 
atom of mass M. For simplicity one should consider only vibrations along the mole¬ 
cule axis connecting the three atoms, where the complicated interatomic potential is 
approximated by two strings (with spring constant k). 
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7 Vibrations of Coupled Mass Points 


Exercise 7.6 


(a) Establish the equation of motion. 

(b) Calculate the eigenfrequencies and discuss the eigenvibrations of the system. 


m M m 

•——o——• 

Fig. 7.15. 


Solution, (a) Let x \, X 2 , *3 be the displacements of the atoms from the equilibrium 
positions at time t. From Newton’s equations and Hooke’s law then it follows that 


mx i = —k(x i — JC 2 ), 


Mx 2 = —k(x 2 — X3) — k(x 2 — x\) = k(x 3 + x\ — 2 x 2 ), 


(7.46) 


mx 3 = — k(x 3 — X2). 


m M m 





X l X 2 X 3 

Fig. 7.16. 


(b) By inserting the ansatz x\ = a\ cos cot, X 2 = ^2 cos cot, and X 3 = <23 cos oot 
into (7.46), one obtains 


(moo 2 — k)a\ + ka 2 = 0 , 

ka\ + (Moo 2 — 2 k)ci 2 + ka?> = 0, 

kci 2 + (moo 2 — k)ci 2 — 0 . 


(7.47) 


The eigenfrequencies of this system are obtained by setting the determinant of 
coefficients equal to zero: 


moo 2 — k k 0 

k Moo 2 — 2k k 

0 k moo 2 — k 


: 0 . 


From this, it follows that 

(moo 2 — k)[oo 4 mM — oo 2 (kM + 2km)] = 0 


(7.48) 


(7.49) 


or 


oo 2 (moo 2 — k)[oo 2 mM — k(M + 2m)] = 0. 


By factorization of (7.49) with respect to 00 , one obtains for the eigenvibrations of the 
system: 



Discussion of the vibration modes: 

(1) Insertion of 00 = oo\ = 0 into (7.47) yields a\ = <22 = <23. The eigenfrequency 
00 1 = 0 does not correspond to a vibrational motion, but represents only a uni¬ 
form translation of the entire molecule: 

(2) Inserting 00 = 002 = (k/m) 1 ^ 2 into (7.47) yields a\ = —<23, <22 = 0; i.e., the central 

atom is at rest, while the outer atoms vibrate against each other: o . 

(3) Inserting 00 = 003 = {k/m( 1 + 2m/M)} 1 / 2 into (7.47) yields a\ = (23, ci2 = 
— (2m/M)a\, i.e., the two outer atoms vibrate in phase, while the central atom 
vibrates with opposite phase and with another amplitude: 









A string of length l is fixed at both ends. Thereby appear forces T that are constant 
in time and independent of the position. The string tension acts as a backdriving force 
when the string is displaced out of the rest position. A string element As at the position 
x experiences the force 

F y (x) = — T sin©(x) 

in y-direction. At the position x + Ax there acts in y-direction the force 
F y (x + Ax) = T sin©(x + Ax). 

In y-direction, the string element As experiences the total force 

F y — T sin@(x + Ax) — T sin©(x). (8.1) 

Accordingly, along the x-direction the string element As is pulled by the force 
F X = T cos ©(x + Ax) — T cos ©(x). 

In a first approximation we assume that the displacement in x-direction shall be zero. 
A displacement of the string in y-direction causes only a very small motion in the 
x-direction. This displacement is negligible compared to the displacement in 
y-direction, i.e., 

F x = 0. 


Since we neglect the displacement in x-direction, the only acceleration component of 
the string element is given by 3 2 y/dt 2 . The mass of the element is m = a As, where 
o represents the line density. From that and by means of (8.1) we obtain the equation 
of motion: 


Fy 



T sin ©(x + Ax) — T sin ©(x). 


( 8 . 2 ) 



Fig. 8.1. The string tension T 
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8 The Vibrating String 


Both sides are divided by Ax: 

a Asd 2 y T sin©(x + Ax) — T sin©(x) 

A xdt 2 Ax 

Inserting for As in the left-hand side of (8.3) 

As = yj' Ax 2 + Ay 2 , 
one has 

ay/Ax 2 + Ay 2 3 2 y l~ 7 Ay\ 2 3 2 y 

Ax dt 2 y \Ax) dt 2 

T sin©(x + Ax) — T sin©(x) 

Av 

By forming the limit for Ax, Ay ^ 0 on both sides of (8.4), we obtain 


a 




2 



(sin©). 


(8.3) 


(8.4) 


(8.5) 


For sin© we have sin© = tan©/Vl + tan 2 ©. Since tan© = dy/dx (inclination of 
the curve), we write 


sin© = 


3y/3x 

yi + Oy/a*) 2 ' 


( 8 . 6 ) 


By means of relation (8.6) the equation (8.5) can be transformed as follows: 


J l + (°y) 2 *y = T A (. m Y 
V \ dx J dt dx Vyr+(9y/9x) 2 / 


(8.7) 


In order to simplify the equation, we again consider only small displacements of the 
string in y-direction. Then 3y/3x 1, and (3y/3x) 2 can be neglected too. 

Thus, we obtain 



T — (— 
dx V dx 


( 8 . 8 ) 


or 


d*y 9 2 y 
a dt 2 ~ dx 2 ' 


(8.9) 


We set c 2 — T/a (c has the dimension of a velocity). The desired differential equation 
(also called the wave equation) then reads 


3 2 y 2 d 2 y 
~di 2=c dx 2 



1 9 2 
c 2 dt 2 


y(x,t) = 0. 


or 


( 8 . 10 ) 
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8.1 Solution of the Wave Equation 


The wave equation (8.10) is solved with given definite boundary conditions and initial 
conditions. The boundary conditions state that the string is tightly clamped at both 
ends v = 0 and x = 1 , i.e., 


y(0 ,r)=0 , y(l,t) = 0 (boundary conditions). 

The initial conditions specify the state of the string at the time t — 0 (initial excitation). 
The excitation is performed by a displacement of the form /(x), 

y(x, 0) = f{x) (first initial condition), 


and the velocity of the string is zero, 


d 

dt 


y(x, t) 



(second initial condition). 


For solving the partial differential equation (PDE), we use the product ansatz y(x,t) = 
X(x) • T (t). Such an approach is obvious, since we are looking for eigenvibrations. 
These are defined so that all mass points (i.e., any string element at any position x) 
vibrate with the same frequency. By the ansatz y (x, t) = X(x) • T ( t ), the time behavior 
is decoupled from the spatial one. Thus we try to split the partial differential equation 
into a function of the position X (.x ) and a function of the time T (t). Inserting y (x, t) = 
X(x) • T(t) into the differential equation (8.10) yields 

X(x)T(t)=c 2 X"(x)T(t), 


where d 2 T/dt 2 = T and d 2 X/dx 2 = X". The above equation can be rewritten as 

nt) =c2 y\x) 

T(t ) C X(x) ■ 

Since one side depends only on x and the other side depends on t , while x and t are 
independent of each other, there is only one possible solution: Both sides are constant. 
The constant will be denoted by — co 2 . 


f 

T 


■■ —co 


or T + co 2 T = 0, 


or 


X" 

~X 


or 


x" + ^x = o. 


( 8 . 11 ) 


( 8 . 12 ) 


The solutions of the differential equations (continuous harmonic vibrations) have the 
form 


T(t) = Asin&tf + B cos cot, 
co co 

X(x) = C sm —x + D cos —x 
c c 

The general solution then reads 


co co 

C sm —x + D cos — 
c c 



y(x, t) = (A sin<xtf + B cos cot) 


(8.13) 
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8 The Vibrating String 


The constants A, B,C, and D are determined from the boundary and initial condi¬ 
tions. 

From the boundary conditions, it follows for (8.11) that 
y (0, 0 = 0 = D(A sin cot + B coscot). 

Since the expression in brackets differs from zero, we must have D = 0. Then (8.13) 
simplifies to 


y(x, t) — C sin — x(A sin cot + B coscot), 
c 

With the second boundary condition, we get 

co 

y(l , 0 = 0 = C sin —/(A sincot + B coscot) 
c 

co 

=> 0 = C sin —/. 

c 

This equation will be satisfied if either of the following holds: 


(a) 
or 

(b) 


C — 0, which means that the entire string is not 
displaced, 

smicol/c) = 0. The sine equals zero if ( co/c)l = rut, i.e., 
if co = co n = nice/ /, where n = 1,2, 3,... 
(n = 0 would lead to case (a)). 


From the boundary conditions, we thus obtain the eigenfrequencies co n — nnc 11 
of the string. Since the string is a continuous system, there are infinitely many eigen¬ 
frequencies. The solution for an eigenfrequency, the normal vibration, was marked by 
the index n. Equation (8.11) becomes 


. nn ( . mtc mtc 

y n (x , t) = C • sm — x I A n sin —j—t + B n cos —j—t 


y n (x, t) = sin —x ( a n sin —— t + b n cos 


mtc 




where we set C • A n =a n and C B n = b n . 
From the initial conditions, we have 


: T-yn(x,t) 
ot 


nnc . nn . 

= 0 = —— sm —v a n cos 


t =o 


nnc 


-t — h n sin - 


nnc 


t =o 


Then 


nnc . nn 

a n •-• sm —x = 0 

Z l 

is satisfied for all v only if a n = 0. Thus, the solution of the differential equation is 
nn nnc 

y n (x, t) =b n • sm ~j~ x cos — — t. (8.14) 

The parameter n describes the excitation states of a system, in this case those of the 
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vibrating string. In quantum physics such a discrete parameter n is called a quantum 
number. 

Interjection: If we had selected a negative separation constant in (8.11), i.e., -\-co 2 
instead of — to 2 , we would have arrived at the solution 

y(x, t) = (Ae at + Be~ wt )[ce + De~ 

The boundary conditions y(0, t) = y(l, t) = 0 would have led to the conditions 

C + D = 0; Cec 1 + De c L = 0 


with the solutions C = D = 0. The string would have remained at rest. But this is not 
the desired solution. 

Since the one-dimensional wave equation is a linear differential equation, one can 
obtain the most general solution, according to the superposition principle, by the su¬ 
perposition (addition) of the particular solutions: 

00 

E . mrx mrc 
b n sm —— cos ——t 

i ^ ^ 

n=\ 

The coefficients b n can be calculated from the given initial curve by using the consid¬ 
erations on the Fourier series (see the next chapter): 

00 

E nnx 
b n sin ——. 

n =1 

The calculation of the Fourier coefficients b n will be shown in the next chapter. One 
then gets the following general solution of the differential equation: 


00 

= bn sin k n x cos co n t . 

n =1 


y(x,t) 



mtx 

sin —j— cos 


nnct 

i 


(8.15) 


8.2 Normal Vibrations 


Normal vibrations are described by the following equation: 

y n (x, t) = C n sin (k n x) cos (co n t). (8.16) 


For a fixed time t , the spatial variation (positional dependence) of the normal vibra¬ 
tion depends on the expression sin(nTtx/l) (for n > 1, sm(nTtx/l) has exactly n — 1 
nodes). All mass points (position x) vibrate with the same frequency co n . 

At a definite position x, the time dependence of the normal vibration is represented 
by the expression cos(njtc/l)t. The wave number k n is defined as 


co n nn 2n 
c l kyi 


(8.17) 


where k n = 21 /n is the wavelength. 
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The angular frequency is defined as follows: 


mtc 

(o n = —— = 2nv n . 


(8.18) 


Solving (8.18) for v n , we obtain for the frequency 


nc 


(8.19) 


i.e., the frequencies increase with increasing index n. By definition, 


c = 



( 8 . 20 ) 


c can be interpreted as “sound” velocity in the string, as we shall see below. T is the 
tension in the string, o is the mass density. From (8.19) and (8.20) we find 



( 8 . 21 ) 


i.e., the longer and thicker a string is, the smaller the frequency. The frequency in¬ 
creases with the string tension T. This agrees with our experience that long, thick 
strings sound deeper than short, thin ones. With increasing string tension the frequency 
increases. This property is utilized when tuning up a violin. 

Multiplication of the wavelength by the frequency yields a constant c which has 
the dimension of a velocity: 

21 nc 

X n v n = -= c (dispersion law). (8.22) 

n 21 

c is the velocity (phase velocity) by which the wave propagates in a medium. This can 
be seen as follows: If an initial perturbation y(x, 0) = fix) is given as in Fig. 8.2, 
f{x — ct) is also a solution of the wave equation (8.10), because with z — x — ct we 
have 


9/ = V9z = _ c 9/ ^l = _ c ^l d _l =c 2^l 

dt dz dt C dz ’ dt 2 C dz 2 dt C dz 2 ’ 


—c — 


and 


9/ = V 9 2 / _ 9 2 / 

dx dz ’ dx 2 dz 2 


Hence, 



fix — ct) thus satisfies the wave equation (8.10). 



o 


Fig. 8.2. Propagation of a perturbation fix) along a long string: After the time t, the perturba¬ 
tion has moved away by ct; it is then described by fix — ct) 
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Let the maximum of the perturbation f(x) be at jcq. After the time t, it lies at 


x — ct = XQ. 


It thus propagates with the velocity 
dx 

— — c 
dt 

along the string, namely to the right (positive v-direction). One can say that the per¬ 
turbation f{x) moves along the string with the velocity 


dx 

dt 


= c. 


(8.23) 


The propagation velocity of small perturbations is called the sound velocity. One 
easily realizes as above that f(x + ct) is also a solution of the wave equation and 
represents a perturbation that moves to the left (negative v-direction). We are deal¬ 
ing here with running waves, while for the tightly clamped string we have standing 
waves. 

If a string is excited with an arbitrary normal frequency, there are points on the 
string that remain at rest at any time {nodes). 

The wavelength, the number of nodes, and the shape of normal vibrations can be 
represented as a function of the index n (see Fig. 8.3). 


(a) 


l x 



Fig. 8.3. The lowest normal 
vibrations of a string 


n 

Wavelength 

Number of nodes 

Figure 

1 

21 

0 

(a) 

2 

l 

1 

(b) 

3 

v 

2 

(c) 


2 



n 

-l 

n 

n — 1 
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Fig. 8.4. Displacement and 
deformation (elongation com¬ 
pression) of the string ele¬ 
ment Ax 


EXERCISE 

8.1 Kinetic and Potential Energy of a Vibrating String 


Problem. Consider a string of density a that is stretched between two points and is 
excited with small amplitudes. 

(a) Calculate in general the kinetic and potential energy of the string. 

(b) Calculate the kinetic and potential energy for waves of the form 


y = C cos 


co(x — ct) 


with Tq = 500 N, C = 0.01 m, and A = 0.1 m. 


Solution, (a) The part PQ of the string has the mass a Ax and the velocity dy/dt. 
Its kinetic energy is then 



The total kinetic energy of the string between x = a and b is 



a 


The work which is needed to elongate the string from Ax to A/ is 

A / 

dP = 7q(A/ — Ax),-1. 


(8.24) 


(8.25) 


(8.26) 



For small displacements, we have 


Al = (Ax 2 + Ay 2 ) 1/2 = Ax 



21 */ 2 


Ax 




The potential energy for the region x = a to x = b is then 



a 


(8.27) 


(8.28) 
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For a wave y = F(x — ct) propagating in a direction, we have 
b 


T = P = 


\ t „ / IF' 


(x — ct)] 2 dx, 


c 2 = 


To 


(8.29) 


Hence, the kinetic and potential energy are equal. If a, b are fixed points, then T and P 
vary with time. But if we admit that a and b can propagate with the sound velocity c, 
so that 

a = A + ct and b = B+ct, (8.30) 


then P and T are constant: 

B 


T = P = 


\n / <f' 


(x)) 2 dx. 


(8.31) 


(b) 


3 y . [ co 

— = C sin ( —v — cot ] co 


( d y\ 


=>► ( — ) = C sin —v — cot \co . 

\dt) \c ) 

Insertion into (8.25) yields {a = 0, b = X) 

x 

T = —y C 2 co 2 f sin 2 (—x — dx = —y C 2 co 2 • I. 

2 c 2 J \c J 2 c 2 


(8.32) 


(8.33) 


With the substitution z = ( co/c)x — cot for the integral /, we find 


(. o)/c)X—cot 


(co/c)k 


2 tt 


I = — / sin 2 zdz= — / sin 2 zdz= — / sin 2 z dz 

CO J CO J CO J 

—cot 0 0 

-\2jt 


- C \ l 1 
co \_2 Z 4 


- - sin( 2 z) 


Jo 

2^2 


C 

= —It 
CO 


1 To 9 9 C lt Z C Z To 

T = —y C 2 co 2 -7T =--, 

2 c 2 co X 


X — 2 7T —. 
CO 


(8.34) 


(8.35) 


One gets the same expression for the potential energy. Insertion of the numerical val¬ 
ues yields 

9 9 500 N 9 

T = P = (0.01 ) 2 • it 2 -m 2 - 5 Nm. 

0.1 m 


EXERCISE 

8.2 Three Different Masses Equidistantly Fixed on a String 


Problem. Calculate the eigenfrequencies of the system of three different masses that 
are fixed equidistantly on a stretched string, as is shown in Fig. 8.5. 

Hint : For small amplitudes, the string tension T does not change! 
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Fig. 8.5. 


2m 

o- 


m 

-Q- 


3 m 

O 


Solution. From Fig. 8 . 6 , we extract for the equations of motion 
2m x\ = T 


~(x 2 -X])~ 

— T 

~X\~ 

L 

— I 

L _ 


mx 2 = —T 
3 m 3c 3 = T 


(X2~X\) 


-T 


\x2-x3) 


(8.36) 


" (X2 ~ X 3 ) " 

— T 

" jc 3 " 

L 

— 1 

L _ 


Fig. 8.6. 



We look for the eigenvibrations. All mass points must then vibrate with the same 
frequency. We therefore start with 


3c 1 = —co 1 A sin(cot + t/r), 

3c 2 = —co 2 B sin (cot + \[r), 
. 2 / 


x\ = A sin (cot + ^r), 

X 2 = B sin(cot + ^r), 

X 3 = C sin(m^ + ^), 3 c *3 = —co z C sin(cot + i/r). 

Hence, after insertion into equation (8.36) one gets 

-2 mco 1 I A — I — I B = 0, 


+ (— - 3mcoAc = 0. 


(8.37) 


For evaluating the eigenfrequencies of the system, i.e., for solving equation (8.37), the 
determinant of coefficients must vanish: 



Expansion of the determinant leads to 



0 = 6 m 3 co 6 
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„ „ , , ( —22T m 2 \ , /19 T 2 m\ /-4T 3 \ 

0 _6m Si + (——js 2 

where we substituted Q = co 2 . This leads to the cubic equation 
a£2 2 T b£2 2 T c£2 T d = 0, 


where 


a = 6 m 5 , b = 


-227m 2 


c = 


19T 2 m 

L 2 


J = 


-4T 3 


It can be transformed to the representation (reduction of the cubic equation) 


y 3 + 3 py + 2 q = 0 , 


(8.39) 


where 


and 


b 

:Q + — . 
3 a 


1 b 2 


11 T 

:Q - 

9 Lm 


3p — + - 

3 a z (2 

Insertion leads to 


71 

3 P ~~54L 2 ^ 2 ' 
From this, it follows that 

q 2 + p 3 < 0, 


and 2 q = 


2 b 3 


27 a 3 


1 d 
3 a z a 


2 q = - 


653 r 3 


i.e., there exist 3 real solutions of the cubic equation (8.39). 

For the case q 2 + p 2 < 0, the solutions y\,y 2 , J 3 can be calculated using tabulated 
auxiliary quantities (see Mathematical Supplement 8.4). Direct application of Car¬ 
dano’s formula would lead to complex expressions for the real roots, hence the above 
method is convenient. 

After insertion one obtains for the auxiliary quantities 

COS (p= - , y\ =2V- j pcos-, 

y/-P 5 

y 2 = -2V^cos(| + | 
y 3 = — 2 ^J—p cos 

and finally, for the eigenfrequencies of the system 



o>i 



co 2 = 0.916 




Exercise 8.2 
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EXERCISE 


8.3 Complicated Coupled Vibrational System 


Problem. Determine the eigenfrequencies of the system of three equal masses sus¬ 
pended between springs with the spring constant k, as is shown in Fig 8.7. 

Hint : Consider the solution method of the preceding Exercise 8.2 and Mathematical 
Supplement 8.4. 


Fig. 8.7. Vibrating coupled 
masses 



Solution. From Fig. 8.7, we extract for the equations of motion 

mx i = —kx i — k(x i — X2) — k(x\ — x$), 

mx 2 = —kx 2 — k(x 2 — x\) — k(x 2 — x$), (8.40) 

mx 3 = —kx 3 — k(x 3 — x\) — k(x 3 — X2), 

or 

mx 1 + 3 kx\ — kx 2 — kx 3 = 0 , 

mx 2 + ?>kx2 — kx 3 — kx\ = 0, (8.41) 

mx 3 + 3^x3 — — kx2 = 0 . 

We look for the eigenvibrations. All mass points must vibrate with the same fre¬ 
quency. Thus, we adopt the ansatz 

x\ = A cos (cot + \[r), x\ = —co 2 A cos(cot + x[r), 

X2 = B cos(cot + \jf), X2 = —C0 2 B cos (cot + Vr), 

X3 = C cos(o>^ + l/r), ic‘3 = —C0 2 C cos(cot + l/r), 

and after insertion into (8.41), we get 

( 3 k - mm 2 ) A -kB-kC = 0 , 

-kA + ( 3 k - mcu 2 )£ - kC = 0 , 

—kA — k£ + ( 3 k — mco 2 )C = 0 . 


(8.42) 
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To get a nontrivial solution of (8.42), the determinant of coefficients must vanish: 



(3 k — moo 2 ) —k 

—k 


—k (3k — mco 2 ) 

—k 


—k —k 

(3k — mco 2 ) 

Expansion of the determinant leads to 


« 9k a 24 k 2 , 

16£ 3 

U = 07 - 07 -I- Z-07 — 



m 

m J 

or 


, 9k „ 24 k 2 

16& 3 

0 — £2 £2 + ~ £2 

’ 


“ u ' o Q 

m m z 


where we substituted Q = co 2 (see Exercise 8.2). The general cubic equation aQ 3 + 
bQ 2 + c £2 + d = 0 (in our case a = 1 , b = —9/c/m, c = 24 k 2 /m 2 , d = —16 k 3 /m 3 ) 
can according to Mathematical Supplement 8.4 be reduced to 

y 3 + 3 py + 2q = 0, 


where 

^ b 

y —, 

3a 

Insertion leads to 
k 2 

3/? =-3 — , 
m z 


1 Z ? 2 c 
3/7 = --^ + -, 
3a z a 


„ 2 Z ? 3 1 Zpc J 

27 3 or a 


k 3 

2 q =2 — 
m J 


^ 2 +A 


: 0 , 


i.e., there exist 3 solutions (the real roots); 2 of them coincide. Hence, the vibrating 
system being treated here is degenerate. As in Exercise 8.2, the solutions can be cal¬ 
culated using tabulated auxiliary quantities. For these, we obtain 


COS (p = 


7 = 


_ (p 

yi =2V-/7COS-, 


(M> 


y 2 = — 2 «J—p cos 

3, 3 = -2V^cos(^|-| 
and, after insertion, for the eigenfrequencies of the system 

nr nr 

0)1 = C02 = 2 J —. 
m V m 


07 3 = 


MATHEMATICAL SUPPLEMENT 
8.4 The Cardano Formula 1 

In theoretical physics, one often meets the problem of solving a cubic equation, just 
as in the Exercises 8.2 and 8.3. We now will clarify this problem. 


1 We follow the exposition of E. v. Hanxleben and R. Hentze, Lehrbuch der Mathematik, Friedrich 

Vieweg & Sohn 1952, Braunschweig-Berlin-Stuttgart. 
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Mathematical Supplement 8.4 Reduction of the general cubic equation: If the general cubic equation 


x 3 4" ax 2 bx + c — 0 


(8.43) 


with nonvanishing coefficients a, b, and c is to be solved, one must first eliminate the 
quadratic term of the equation, i.e., reduce the equation. If the unknown x is replaced 
by y + A, where y and A are new, unknown quantities, (8.43) turns into 

(y 3 + 3y 2 A 3yk 2 A. 3 ) 4- (ay 2 ~\~ 2ayk ak 2 ) 4- (by H- bX) -\~ c = 0, 

y 3 + (3A + a)y 2 + (3k 2 + 2ak 4- b)y + (A 3 + ak 2 + bk + c) = 0. (8.44) 

Since we have replaced one unknown quantity x by two unknown ones, y and A, we 
can freely dispose of one of the two unknown quantities. This freedom is exploited so 
as to let the quadratic term of the equation disappear. This is achieved by setting the 
coefficient of y 2 , that is, 3A + a , equal to zero, i.e., A = —a 13. By inserting this value 
(8.44) changes to 



(8.45) 


If we set the expressions determined by the known coefficients a, b, and c of the cubic 
equation, 


,2 


2 a 3 ab 

-\-b = p and-1 - c = q, 

1 27 3 * 


a 


(8.46) 


3 


the cubic equation takes the form 

y 3 + py + q = 0 (reduced cubic equation). 


(8.47) 


Result: To reduce the cubic equation given in the normal form, one sets x = y —a/3. 
Then (8.47) follows from (8.43). 

Example: X 3 - 9x 2 + 33jc - 65 = 0. 

(1) Solution : Set x = y — (—3) = y + 3. 


(y + 3) 3 - 9 (y + 3) 2 + 33 (y + 3) - 65 = 0, 
(; y 3 + 9y 2 + 21y + 27) - 9(y 2 + 6y + 9) + 33(y + 3) - 65 = 0, 

y 3 + 6y-20 = 0. 


(2) Solution : Insert the values calculated from (8.46) into (8.47). 

Special case: If in the general cubic equation, the linear term is missing (b = 0), 
i.e., the cubic equation is given in the form 


x 3 + ax 2 + c = 0, 


(8.48) 


the reduction can also be performed by inserting 
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From (8.48) and (8.49), we obtain the reduced equation 

c 3 c 2 

— + c = 0 or y 3 + acy + c 2 = 0. (8.50) 

r y 1 

Solution of the reduced cubic equation: If one sets in the reduced cubic equation 

y 3 + py + q = o, 

(8.51) 

y = u + v, 

one obtains 

u 3 + 3u 2 v + 3 uv 2 + t; 3 + p(u + v) + q = 0, 

( u 3 + i; 3 + g) + 3uv{u + u) + p(u + u) = 0, 

(w 3 + u 3 + g) + (3 uv + /?)(w + i;) = 0. (8.52) 


Since one can freely dispose of one of the unknown quantities a or u (justifica¬ 
tion?), these are suitably chosen so that the coefficient of (u + v) vanishes. We there¬ 
fore set 


3uv + p = 0, i.e., uv = — —. 


(8.53) 


Equation (8.52) simplifies to 

u 3 v 3 -|- ^ = 0 or u 3 -f i > 3 = — q. 


(8.54) 


u and v are determined by (8.53) and (8.54). The quantities u and v can no longer be 
arbitrarily chosen. By raising (8.54) to the second power and (8.53) to the third power, 
one obtains 


u 6 + 2u 3 v 3 + v 6 = q 2 , 


4 u 3 v 3 = -4 


(!)* 


Subtraction of the two equations yields 
(u 3 -v 3 ) 2 = q 2 + 4 ff 3 


v 3 = ±Jq 2 + 4 Hr 


(8.55) 


By addition and subtraction of equations (8.54) and (8.55), one obtains 
* 1 


U 2 


-q±Jq 2 + 4(| 


, 1 
and v = - 


<lTjq 2 + 4 (| 


U = 3 

N 


+ - 


(i) + (fi 


and v = 3 

N 


'( 1)4 


(8.56) 


Mathematical Supplement 8.4 
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Mathematical Supplement 8.4 


If one sets 



one gets 

U\ = m, U 2 = m€ 2 , U 3 = M 63 , 

v\ = n , V 2 = nc 2 , V 2 =n€ 2 ,. 

Here, the 6 ; are the unit roots of the cubic equation x 3 = 1 which, as is evident, read 

1 V3 1 73 

S| = 1 , *i = -- + i —, « = • 

Since now y = w + u, one can actually form 9 values for y (why?). But since the 
quantities w and t; must satisfy the determining equation (8.53), the number of possible 
connections between u and v is restricted to 3, namely, 

yi=Ui + V\, yi = U 2 + V 2 , y?> = U2 + V2\ 

hence, 


m + n m — n r 
y 2 =me 2 + nc3 =---1- —*V3, 

m + n m — n r 
y 3 =m€3+n € 2 =---—*V3. 

The real root of the cubic equation, i.e., the root 



(8.57) 



is known as the “Cardano formula.” It was named in honor of the Italian Hieronimo 
Cardano 2 to whom the discovery of the formula was falsely ascribed. Actually, the 


2 Hieronimo Cardano, Italian physicist, mathematician, and astrologer, b. Sept. 24, 1501, Pavia- 
d. Sept. 20, 1576, Rome. Cardano was the illegitimate son of Fazio (Bonifacius) Cardano, a friend 
of Leonardo da Vinci. He studied at the universities of Pavia and Padua, and in 1526 he graduated in 
medicine. In 1532, he went to Milan, where he lived in deep poverty, until he got a position teaching 
in mathematics. In 1539, he worked at a high school of physics, where he soon became the director. 
In 1543, he accepted a professorship for medicine in Pavia. 

As a mathematician, Cardano was the most prominent personality of his age. In 1539, he published 
two books on arithmetic methods. At this time, the discovery of a solution method for the cubic 
equation became known. Nicolo Tartaglia, a Venetian mathematician, was the owner. Cardano tried in 
vain to get permission to publish it. Tartaglia left the method to him under the condition that he keeps 
it secret. In 1545, Cardano’s book Artis magnae sive de regulis algebraicis, one of the cornerstones of 
the history of algebra, was published. The book contained, besides many other new facts, the method 
of solving cubic equations. The publication caused a serious controversy with Tartaglia. 
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formula is due to the Bolognesian professor of mathematics Scipione del Ferro, 3 who 
found this ingenious algorithm. 

Example: y 3 — 15y — 126 = 0. Here, 


p = —15, q = —126, 



By inserting into the Cardano formula, one obtains 

Vi = 763 + V632^P + 763- 

= 763 + 73844 + 763 - 73844c 

= 763 + 62 + 763 - 62 

= 7l25 + 7l (=m+n) 

= 6 , 

yi = — ^yl + ~ 2 1 *^3 = -3 + 2/73, 

- ^2/V3 = -3 - 2/73. 

Check the validity of the roots by insertion! 

Discussion of Cardano’s formula: The square root appearing in the Cardano for¬ 
mula only yields a real value if the radicand (q/2) 2 + (p/ 3) 3 > 0. If the radicand 
is negative, the three values for y yield complex numbers. We consider the possible 
cases: 


( 1 ) 

( 2 ) 


(a) 

(b) 

(c) 


p > 0 

p < 0, namely, 




Form of the roots 

Real 

A real value, two complex con¬ 
jugate values 

Real 

As in (1). 

= 0 

Three real values, among them 
a double root 

Imaginary 

All three roots by the form 
imaginary 


The case (2c) was of particular interest to the mathematicians of the Middle Ages. 
Since any cubic equation has at least one real root, but they could not find it by means 
of Cardano’s formula, the case was called the casus irreducibilis . 4 The first to solve 


3 Scipione del Ferro , b. 1465(7)—d. 1526 (?). About his life we know only that he lectured from 1496 
to 1526 at the university of Bologna. By 1500, he discovered the method of solving the cubic equation 
but did not publish it. Tartaglia rediscovered the method in 1535. 

4 Casus irreducibilis (Lat.) = “the nonreducible case”. 
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this case was the French politician and mathematician Vieta. 5 He proved by using 
trigonometry that this case was solvable too, and that in this case the equation has 
three real roots. 

Trigonometric solution of the irreducible case: Since p is negative in this case, 
one starts from the reduced cubic equation 

y 3 - py + q = 0, (8.58) 

where p must now be kept fixed as absolute numerical value. According to the trigono¬ 
metric formulae we have 


cos 3a = cos(2a + a) = cos 2a cos a — sin 2a sin a 
= (cos 2 a — sin 2 a) cos a — 2 sin 2 a cos a 
= cos 3 a — sin 2 a cos a — 2 sin 2 a cos a 
= cos 3 a — (l — cos 2 a) cos a — 2(1 — cos 2 a) cos a 
= cos 3 a — cos a + cos 3 a — 2 cos a + 2 cos 3 a 
= 4 cos 3 a — 3 cos a, 


thus, 


o 3 1 

cos a -cos a -cos 3a = 0. 

4 4 


(8.59) 


If one considers cos a to be unknown, (8.59) coincides with the form of (8.58). But 
since the value of the cosine varies only between the limits —1 and +1, while y, ac¬ 
cording to the values of p and q , can take any values, one cannot simply set cos a = y. 
By multiplying (8.59) by a still uncertain positive factor q 3 , one obtains 


q 3 cos 3 a — -q 2 • £ cos a — -£ 3 cos 3a = 0. 


(8.60) 


By setting q • cosa = y, p = (3/4 )q 2 , and q = — (l/4)£ 3 cos3a, (8.60) turns into 
(8.58). From this, we find 


4 = 2 - 



(8.61) 


and 

4 q -4 q q/2 

cos 3a =- y =-; - =- , . 

e 3 8 • (p/3)^fp]3 7^73)3 

Equation (8.62) is ambiguous, since the cosine is a periodic function. One has 


3a = + k • 360°, where k = 0, 1,2, 3,_ 


(8.63) 


5 Frangois Vieta, French mathematician, b. 1540, Fontenay-le-Comte-d. Dec. 13, 1603, Paris. Ad¬ 
vocate and adviser of Parliament in the Bretagne. His greatest achievements were in the theory of 
equations and algebra, where he introduced and systematically used letter notations. He established 
the rules for the rectangular spherical triangle which are often ascribed to Neper. In his Canon math- 
ematicus, a table of angular functions (1571), he emphasized the advantages of decimal notation. 
[BR] 
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From this, we find for a 


<P 

0i\ = —, 

3 


c* 2 = | + 120 o , 


a 3 = |+240 o . 


Compare this consideration with the problem of cyclotomy! Which values are ob¬ 
tained for a if k = 3, 4,...? 

For y, one obtains 



so that the roots of the cubic equations are 



(8.64) 


Comment: The formulas of the casus irreducibilis can also be derived by means of 
the Moivre’s theorem. 

Example: Calculate the roots of the equation 
y 3 -981^- 11340 = 0. 


Solution: Since p < 0 and 



( 



2 


= 5670 2 , 



3-log 327 = 7.5436, 
2-log 5670 = 7.5072, 
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by comparing the logarithms it follows that |(/?/3) 3 | > (q/ 2) 2 . Thus, the condition of 
the casus irreducibilis is fulfilled. According to (8.62) 

5670 

cos 3a = H— . , 

V3273 

log cos 3a = 3.7536 - 3.7718 = 9.9818 - 10, 

^ = 3a«16°30 / , hence, ^=a = 5°30'. 

3 

From (8.64), we obtain y\ = 36, y 2 = —21, y 3 = —15. Check the root values by in¬ 
sertion! 






Fourier Series 



When setting the initial conditions for the problem of the vibrating string, a trigono¬ 
metric series was set equal to a given function fix). The expansion coefficients of 
the series had to be determined. To solve the problem, the function fix) should also 
be represented by a trigonometric series. These trigonometric series are called Fourier 
series. 1 The conditions that allow an expansion of a function into a Fourier series are 
summarized as follows: 

(1) fix) is defined in the interval a < x < a + 21, 

(2) f{x) and f\x) are piecewise continuous on a < x < a + 21, 

(3) f(x) has a finite number of discontinuities which are finite jump discontinuities, 
and 

(4) fix) has the period 21, i.e., fix + 21) = fix). 

These conditions (Dirichlet conditions) are sufficient to represent fix) by a Fourier 
series: 


fix) = 



+ b n sin 


YITCX \ 

~T~ )■ 


(9.1) 


1 Jean Baptiste Joseph Fourier, b. March 21, 1768, Auxerre, son of a tailor-d. May 16, 1830, Paris. 
Fourier attended the home Ecole Militaire. Because of his origin he was excluded from an officer’s 
career. Fourier decided to join the clergy, but did not take a vow because of the outbreak of the rev¬ 
olution of 1789. Fourier first took a teaching position in Auxerre. Soon he turned to politics and 
was arrested several times. In 1795, he was sent to Paris to study at the Ecole Normale. He soon 
became member of the teaching staff of the newly founded Ecole Poly technique. In 1798, he be¬ 
came director of the Institut d’Egypte in Cairo. Only in 1801 did he return to Paris, where he was 
appointed by Napoleon as a prefect of the departement Isere. During his term of office from 1802 
to 1815, he arranged the drainage of the malaria-infested marshes of Bourgoin. After the downfall 
of Napoleon, Fourier was dismissed from all posts by the Bourbons. However, in 1817 the king had 
to agree to Fourier’s election to the Academy of Sciences, where he became permanent secretary 
in 1822. Fourier’s most important mathematical achievement was his treatment of the notion of the 
function. The problem of the vibrating string that had been treated already by D’Alembert, Euler, and 
Lagrange, and had been solved in 1755 by D. Bernoulli by a trigonometric series. The subsequent 
question of whether an “arbitrary” function can be represented by such a series was answered 1807/12 
by Fourier in the affirmative. The question about the conditions for such a representation could be 
answered only by his friend Dirichlet. Fourier became known mainly by his Theorie analytique de la 
chaleur (1822) which deals mainly with the discussion of the equation of heat propagation in terms of 
Fourier-series. This work represents the starting point for treating partial differential equations with 
boundary conditions by means of trigonometric series. Fourier also made import contributions to the 
theory of solving equations and to the probability calculus. 
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9 Fourier Series 


The Fourier coefficients a n ,b n , and ao are determined as follows: 

&-\-2l 


bn 


ao : 


7 / 


mex 

fix) cos —— dx , 


a 

a+2l 


7 / 


rutx 

fix) sin —— dx, 


(9.2) 


a 

a+2l 


\f 


fix) dx. 


To prove these formulas, one needs the so-called orthogonality relations of the trigono¬ 
metric functions: 


21 


I 


me x rmex 

cos-cos- dx = l 8 nm , 


u 

/ si 


me x rmex 

sin — T — sin- dx = / 8 nm , 


o 

2 / 

/ si 


177TX m7TX 

sin-cos-dx = 0. 

/ / 


(9.3) 


The first relation can be proven by means of the theorem 


cos A cos B = ^ (cos(A + 5) + cos(A — 5)), 


2 / 


/ 


me x rmex 

cos-cos-dx = 

/ / 2 

o o 


2 / 

£/(■ 


(ft+m)7rx (n — m) iex 

cos-b cos- 


dx = 0, 


if it ^ m. The integral of the cosine function over a full period vanishes. For n = m 
we have 


21 

/< 


me x meex 


cos —— cos —— 


21 

dx = - / ( 1 + cos ■ 


-\i( 


2mex 

T~ 


dx = l. 


The other relations can be proved in an analogous way. 

Formula (9.2) for calculating the Fourier coefficients can be proved by means of 
the orthogonality relations. 

To determine the a n , one multiplies the equation 


ao \—> riJL a ^> 

f (x) = — + ^Cl n C°S -j- + 2_ bn S1 


nee x 


me x 


sm 


n =1 


n =1 
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by cos(m7tx/l ) and then integrates over the interval 0 to 2/: 


21 


21 


21 


/ m7tx 
f{x) cos —-— ax 


ao ( 

2 J 


m 7TX 
cos —-— ax + 


J2 a n f 

n =1 n 


/t7TV m7TX 

cos —— cos —-— ax 


21 


+!>/ 

n =1 n 


m7rv 

sin —-— cos —-— av 


— ^^a n l8 nm — la m , 


and therefore, 

2/ 

am = j) 

0 


n=\ 


mnx 

fix) cos —-— ax, 


(9.4) 


as is given by (9.2). 

The analogous relation for the b m can be confirmed by multiplication of (9.1) by 
sinimTtx/l) and integration from 0 to 2/; the same holds for the calculation of ao. 
Functions that satisfy 

fix) = fi-x) 

are called even functions', functions with the property 
fix) = -fi-x) 


are called odd functions . For instance, fix) = cos v evidently is an even function and 
f ix) = sin jc an odd function. The part of (9.1) 

00 


ao 

~2 


+E 


a n cos 


n =1 


nnx 

~T~ 


is obviously even, while 

00 

E rntx 

b n sin —— 

n =1 


represents the odd part of the series expansion (9.1). Therefore, for even functions all 
b n = 0, for odd functions ao and all a n are equal to zero. 

Any function fix) can be decomposed into an even and an odd part. Thus, if(x) + 
fi—x))/ 2 is the even part and ifix) — /(— x))/2 the odd part of fix) = lifix) + 
fi-x))/2 + (f(x)-f(-x))/2l. 


EXAMPLE 


9.1 Inclusion of the Initial Conditions for the Vibrating String by Means of the 
Fourier Expansion 

A string is fixed at both ends. The center is displaced from the equilibrium position by 
the distance H and then released. From Fig. 9.1 we see that the initial displacement is 
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Fig. 9.1. 



given by 


y(x,0) = f(x) = 


Hx 

2_ T’ 

2 H{l-x) 

7 ’ 


0<x<-, 


- <x <1. 
2 ~ ~ 


If we assume f(x) is an odd function (dashed line), we then obtain 


/ 


/ . 117TX 

f (x) sin —— dx 

o 


1/2 

(P 


2Hx . mtx 


l 


-sin- 

/ / 


. 2 H mtx ' 

dx + / -(/ — x) sin- dx 

l l i 


/ 

1/2 


1/2 

/- 


2 Hx rurx 2H 

— sin- dx = - 

/ / l 


l mtx l 2 

—x — cos-1- 

mt l 


2-rr2 Sm / 


n jc 


~\l/2 


JO 


21H mr HI mr 

“2 2 Sm - cos ^- ; 

n z 7r z 2 nit 2 


l 


f 

1/2 


2H . mtx 

-j- (/ — x) sm —— dx 


2 H 

~T 

2 H 

~T 


/ 1 1 \ 

f [ mtx C mtx 

/ / sm- dx — x sm-dx 

J l J l > 

x //2 1/2 ' 


l 1 mtx xl mtx l 2 . mtx 

-cos —-1-cos —- -z -z sm —— 

mt l mt l mit z / 


/ 

1/2 


21H . mt IH mt 
sm-1-cos —, 


bn — , 


2/2 IH . mt 21H 


r sin + 

/ \n 2 7t 2 2 n^it 


. mt 

2^2 Sm T" 


8 // 


. ft7T 

2^2 Sm ^“- 


n it 


By inserting the solution for the Fourier coefficient b n into the general solution of 
the differential equation (8.15), we get the equation that describes the vibrations of 
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a string: 


Example 9.1 


y(x,t) = ^2 


n =1 


8 H . nn 
n z n z 2 


. nnx nnct 
sin —-— cos —-— 


8 H 

7T 2 


i . 7 XX 

j2 sm ~y~ cos 


net 

~T 


3nx 


_ 3^ Sm 


■ COS 


+ 52™ 


5 nx 5nct 

~r cos— T 


3nct 

~T~ 


Thus, by plucking the string in the center one essentially excites the fundamental 
mode (lowest eigenvibration) sin(nx/1) cos(nct/1). Several overtones are admixed 
with small amplitude. The initial displacement obviously corresponds to the funda¬ 
mental vibration. If one wants to excite pure overtones, the initial displacement must 
be selected according to the desired higher harmonic vibration (compare Fig. 8.3). 


EXERCISE 

9.2 Fourier Series of the Sawtooth Function 

Problem. Find the Fourier series of the function 

f(x ) = 4x, 0 <x < 10, with period 21 = 10, l = 5. 


Solution. The Fourier coefficients are 


10 

ao = - j 4 xdx = -x 


if 

0 

1C 

1 f nnx 4x 

' n = - I 4x cos —— dx = — 
5 J 5 nn 


4x nnx 
cos ——- 


10 

-±fs 

nn J 


nnx 

sm ——- dx 


:0 + 


20 


nnx 


- cos 


10 


= 0 , 


10 10 10 

4 f nnx 4x nnx 4 f 

b n = - xsm - dx = -cos- H-/ c 

5 J 5 nn 5 0 nn J 


nnx 

cos- dx 


40 20 




nnx 


- sm ■ 


10 


40 

nn 


Hence, the Fourier series reads 


40 1 nnx 

fix) = 20-Y] - Sin ——. 

n £ ' n 5 

n =1 
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The first partial sums S n of this series are drawn in Fig. 9.2. A comparison of this 
series with the starting curve f(x) illustrates the convergence of this Fourier series. 


EXERCISE 


9.3 Vibrating String with a Given Velocity Distribution 


Problem. Find the transverse displacement of a vibrating string of length l with 
fixed endpoints if the string is initially in its rest position and has a velocity distribu¬ 
tion g(x). 

Solution. We look for the solution of the boundary value problem 


d 2 y 2 d 2 y 
dt 2 dx 2 ’ 


(9.5) 


where y = y(x, t), with 


y(0, t) = 0, y(l,0 = 0, 

y(x, 0) = 0, ^-y(x, t) 

ot 

We use the separation ansatz y = X(x) • T (t). By inserting it into (9.5), one obtains 

X-T = c 2 X"T or ?L( x ) = S-( t ). (9.7) 

Since the left-hand side of (9.7) depends only on x, the right side only on t , and v and 
t are independent of each other, the equation is satisfied only then if both sides are 
constant. The constant is denoted by —A 2 . 

A" 9 f 9 

— = —X 2 and — = —X 2 , 

X c 2 T 

or, transformed, 

X" + \ 2 X=0 and f + X 2 c 2 T= 0. (9.8) 


= g(x). 


t =0 


(9.6) 


The two equations have the solutions 


X = A\ coslv + B\ sinAv, 


T = A 2 cos kct + B 2 sin Xct . 
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Since y = X • T, we have 

y (v, r) = (Ai cos Xx + B\ sin Xx) (A 2 cos Xct + B 2 sin Xct ). (9.9) 

From the condition y(0, t) = 0, it follows that A\(A 2 cos Xct + .62 sinA.cf) = 0. This 
condition is satisfied by A\ = 0. Then 

y(x , t ) = B\ sin Xx ( A 2 cos + B 2 sin . 

We now set 


B\A2 = cl, B\B2 = b , 
and it follows that 

y(x, t) = sin Xx(a cos Xct + bsmXct). (9.10) 

From the condition y(l, t) = 0, it follows that sin XI = 0. This happens if 

XI = nit or X = —j~. (9.11) 

Here, n = 1,2,3,.... The value n — 0 which seems possible at first sight leads to 
y(x,t ) = 0 and must be excluded. The relation (9.11) is inserted into (9.10). The 
normal vibration will be labeled by the index n : 


y n (x,t) = sin- 


mtx 


mrct . n7Tct\ 

a n cos —-- 1 - b n sm —-— I. 


(9.12) 


Because y(x, 0) = 0, all a n = 0, we have 

. nicx . mrct 
y n (x , t) = b n sm —— sm —-—. 

By differentiation of (9.13), we get 


(9.13) 


dyn 

~dt 


nnc . nitx rurct 
- b n —— sm cos - 


/ 


/ 


/ 


(9.14) 


For linear differential equations, the superposition principle holds, so that the entire 
solution looks as follows: 


3 y rutcb n nnx nixct 

— = > -sm-cos-. 

m ^ 1 1 1 


n =1 


3 1 

Because 

3 

—y(Ar, r) 
Ot 

it follows that 


(9.15) 


= g(x). 


t=0 


00 J 

E rutcbn nnx 
—— sm—. 


(9.16) 


n =1 


Exercise 9.3 
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Exercise 9.3 


The Fourier coefficients then follow by 


nncbn 2 f mrx 

I g(x ) sin —-— ax 


l l 


(9.17) 


or 


l 

2 f mrx 

b n — - / g(x) sm —-— ax. 

mrc J 

o 


/ 


By inserting (9.18) into (9.13), we obtain the final solution for y(x,t): 

_ / / x 

/ • n 
p(x 1 sin — 


E ( 2 f . mrx ,\ . mrx . mrct 
I - / g(x ) sin — T — ax I sm sm 


n =1 


/ l 


0 


(9.18) 


(9.19) 


EXERCISE 


9.4 Fourier Series for a Step Function 


Problem. Given the function 

0, for —5 < x < 0, 


/(*) = 


3, for 0 < x < 5 


period 2/ = 10. 


(a) Sketch the function. 

(b) Determine its Fourier series. 


Solution, (a) 

0, for — 5 < x < 0, 


fix). 


3, for 0 < x < 5 


period 21 = 10. 


Ax) 




'X 




i i i i J 


i i 

i i 

i i 

i i 


i i i i 

-15 -10 -5 

5 

10 15 

X 


Fig. 9.3. 
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(b) For period 21 = 10 and / = 5, we choose the interval a to a + 21 to be —5 to 5, 
i.e., a = —5: 

<2+2/ 5 

^ . mrx 1 f mrx 

a n = — I f{x) cos —-— dx = - / fix) cos —-— dx 


Exercise 9.4 


if 

a 

r 0 

it 


i 


i 


-5 


0 5.5 

mrx f mrx 3 f mrx 

, . (0) cos- dx + / 3 cos- dx > = - / cos-dv 

5 J 5 J 5 5J 5 

—5 0 J 0 


3 f 5 mrx 

- \ — sin- 

5 \ mr 5 


= 0 for n ^ 0. 


For /7 = 0, one has a n = clq = (3/5) / Q 5 cos(07iv/5) dx = (3/5) / Q 5 dx = 3. 
Furthermore, 

<2+2/ 


* . mrx 1 f mrx 

b n = - I f(x) sin ——dx = - f(x) sin —-—Jv 

<2 -5 


7/ 


iff f mrx ] 3 f nnx 

-{ / (0) sm- dx -\- I 3 sin-c/v > = - / sm- 

51/ 5 J 5 5; 5 

—5 0 J 0 

3/5 

= -1-cos 

5 \ /17T 




Thus, 


= —(1 — cos mr ). 


( mrx \ 
— (1 — cos«7r) sinl —- 
mr \ 5 J 


n =1 


i.e., 


3 6 / 7rv 1 2irx 1 57 Tjc 

/ W = + H-( sm — + - sin + - sm + ■ 

27r\53 5 5 5 


EXERCISE 


9.5 On the Unambiguousness of the Tautochrone Problem 


Problem. Which trajectory of the mass of a mathematical pendulum yields a pen¬ 
dulum period that is independent of the amplitude? 

Solution. We consider Fig. 9.4. From energy conservation, we have 

m .9 

—siy) + gmy = mgh (9.20) 
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Exercise 9.5 


or 


s(y) = y/2g(h-y). 

From this, one can calculate the period by separation of the variables: 

T/4 s(h ) 

T = 

4 


i /*+ n 

-T= j dt = J ~J= _ = = f 
0 0 0 


(ds/dy)dy 


Using the variable u = y/h , (9.22) changes to 
l 


W 


(ds/dy)Vhdu 
y/2g(\-u) 


(9.21) 


(9.22) 


(9.23) 


We now require that T be independent of the maximum height h : 
dT 

— =0 for all h. 
dh 

Thus, we get from (9.23) (s' = ds/dy) 
l _ l 


/ s'\l~hdu [ du (1 1/9/ /—ds'\ 

- / (-/T^V + Vft— =0 

V2g(l m) J y/2g(l — u) \2 J/z / 

0 


(9.24) 


= 0 for all A. (9.25) 


With the condition that we keep the dimensionless variable u = y/h constant, we can 
rewrite the derivative with respect to h as a derivative with respect to y. 


ds uds ds .. 

— =-= u — = us , 

dh d(uh) dy 

and thus, we can transform (9.25) into 

l 


(9.26) 


/ 

o 


— (s' + 2ys") —— = 0 for all h. 
y/%g(\-uY \/h 


(9.27) 


Any periodic function f(u) satisfying f(u) du = 0 can generally be expanded into 
a Fourier series: 


f(u) = [a m sin(2nmu) + Z? m cos(2jrmw)]. 


m =1 


Therefore, from (9.27) it follows that 
1 t \J%gh( 1 — u) 


s + — j' = 

2y 


2y 

y/%gh(h-y) 
2y 


Y, ( a ™ sin(27rmw) + Z? m cos(27rmw)) 


m=l 

oo 


m =1 


X] ( sin( 2nmj^ J + b m cos| 2nm 


(9.28) 


(9.29) 
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This holds for all values of h. The left-hand side of (9.29) does not contain h\ therefore, 
the right-hand side must be independent of h too. This holds only for a m = b m = 0 
(for all m), as we shall prove now. 

To have the right-hand side of (9.29) independent of h, we must have 


E 

m=l 




“b b m COS 



constant • (y/h)h^^ 2 
V8*(l -y/h) ~ 


(9.30) 


or 


00 

sin(27rmw) + b m cos(27rmw)] 

m=l 


U h V 2 c 

Vl — U x/Sg 


By integrating (9.31) from 0 to 1, we obtain 
l 


h 1 ' 2 f u 
0= —=C / 7 

j Vi — u 


Ah 1 / 2 

; dll =- 1=C, 


(9.31) 


(9.32) 


thus, C = 0. (This reflects the fact that w/V 1 — w cannot be expanded into a Fourier 
series d /a (9.31).) 

Inserting this result C = 0 again into (9.30), we have a m = b m = OVm, and thus, 
from (9.29) 

s" + — = 0. (9.33) 

2y 

From this, one finds by integrating once 

- = -T =► s ' = — = c t -<MZ)\ny _ (9.34) 

s' 2 y dy Jy 

The constant is usually denoted by 



so that we have to solve 


(9.35) 


ds _ IT l 
dy v2 vy’ 

This is the differential equation of a cycloid. 2 


(9.36) 


Exercise 9.5 


2 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 
(2004), Problem 24.4. 
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10 


We consider a two-dimensional system: the vibrating membrane. We shall see that the 
methods applied for the treatment of a vibrating string can be simply transferred in 
many respects. 

The membrane is a skin without an elasticity of its own. The stretching of the 
membrane along the edge leads to a tension force that acts as a backdriving force on a 
deformed membrane. 

Let the tangential tension in the membrane be spatially constant and time indepen¬ 
dent. We consider only vibrations with amplitudes so small that displacements within 
the membrane plane can be neglected. 


10.1 Derivation of the Differential Equation 

We introduce the following notations: a is the surface density of the membrane, and 
the membrane tension is T (force per unit length). Let the coordinate system be ori¬ 
ented so that the membrane lies in the x,y-plane. The displacements perpendicular to 
this plane are denoted by u = u(x, y, t). 

To set up the equation of motion, we imagine a cut of length Ax through the mem¬ 
brane parallel to the x-axis, and a cut Ay parallel to the y-axis. The force acting on 
the membrane element Ax Ay in the x-direction is the product of the tension and the 
length of the cut: F X = T Ay. Analogously for the y-component we have F y — T Ax. 

The surface element Ax Ay is pulled by the sum of the two forces. If the membrane 
is displaced, the u -component of this sum acts on it. 

From Fig. 10.1, we see 

F U = T Ax(sin<p(y + Ay) — sin^(y)) + T Ay(sin?7(x + Ax) — sin#(x)). (10.1) 

Since we restrict ourselves to small amplitudes and angles, the sine can be replaced 
by the tangent. For the tangent we then insert the differential quotient, e.g., 

3 u 

tan <p(x, y + Ay) = — (x, y + Ay), 
dy 

i.e., the partial derivative with respect to y at the point y + Ay. 

Equation (10.1) then takes the form 

(d u 3 u \ /3 u 3 u \ 

F U = T Axl —(x,y + Ay) - — (x,y) ) + T Ay( —(x + Ax) - — (x,y)\. 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_10, © Springer-Verlag Berlin Heidelberg 2010 


133 





134 


10 The Vibrating Membrane 


Fig. 10.1. The vibrating mem¬ 
brane seen in perspective (a), 
various cuts through the mem¬ 
brane (b), and from the top (c) 



(C) 

y 

y + Ay 

y 



X 


Moving the product T Ax Ay to the left side, one has 

u y \ Ay Ax 

We replace the area Ax Ay of the membrane element by A m/cr, where A m is its 
mass, and a = Am/ Ax Ay is the mass density per unit surface. Turning now to the 
differentials, Ax, Ay —> 0, we find 


lim 

Ax^O 


d j(x + Ax 1 y) 
Ax 




d 2 u 
3x 2 


(*,30 


Am (d 2 u 3 2 u\ 

F ' = T —{a* + &} 

With this force, we arrive at the equation of motion 

3 2 u Am ( 3 2 u 3 2 u 

m 9f2 =T ^~\d^ + 


With the abbreviation T/a =c 2 and the Laplace operator, one obtains 
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This form of the wave equation is independent of the dimension of the vibrating 
medium. If we insert the three-dimensional Laplace operator and set u = u(x, y, z, t), 
(10.2) also holds for sound vibrations (u then represents the density variation of the 
air), c is the propagation velocity of small perturbations (velocity of sound)—similar 
to the case of the vibrating string. 


10.2 Solution of the Differential Equation: Rectangular Membrane 

We will now solve the two-dimensional wave equation (10.2) for the example of the 
rectangular membrane. 

We have the boundary conditions which mean that the membrane cannot vibrate at 
the boundary: u( 0, y, t) = u(a, y, t) = u(x, 0, t) = u(x, b, t) = 0. To solve the equa¬ 
tion, we again use the product ansatz 

u(x, y, t) = V (x, y) • Z(t ), 

by means of which we first of all separate the space variables from the time variables. 
Normal vibrations are of this type. All points x, y (mass points) then have the same 
time behavior. This is typical for eigenvibrations. By insertion into the wave equation, 
we obtain 

Z(t) _ ^ 2 AV (x, y) 

Z(t ) C V(x,y) ' 

Here, one has a function of only the position equal to a function that depends only on 
the time. Thus, this identity is only valid if both functions are constants, i.e., unchang¬ 
ing with respect to space and time. The constant that equals these functions is denoted 
by —co 2 , the quotient co 2 /c 2 by k 2 . 

One then has 



(10.3) 


7 


AV(x, y) 
V(x,y) 



(10.4) 


We can at once write down the general solution of (10.3): 

Z(t) = Asin(cot + S). 

If we had selected a positive separation constant, i.e., +oo 2 in (10.3), the solution 
would have been Z(t) = t ±(0t . This means that the solution would either explode 
with the time (e +wr ) or fade away (e“"0- The negative separation constant in (10.3) 
obviously guarantees harmonic solutions. 

In order to separate the two space variables, we use a further separation ansatz: 



0 ax 

Fig. 10.2. A rectangular mem¬ 
brane 


V(x,y) = X(x)-Y(y). 


Insertion into (10.4) yields 



OI o 

+ X— T + IcXY — 0. 
dy z 
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From this, it follows after division by X(x)Y (y) that 


i a 2 x(x) l a 2 r(y) 2 

X(x) dx 2 + Y(y) dy 2 + 



Here again, a function of x equals a function of y only if both are constants. 
We split the constant k 2 into 



and thus obtain 


l d 2 x 

19 ? 



Therefore, one has 


+ ktX = 0, 


d 2 x 

dx 2 

d 2 Y 2 
— +k y Y- 0, 


l d 2 Y 
Y ~dy 2 



solution: X(x) = A\ sin (k x x + 8 \), 
solution: Y (y) = A 2 sin^y + 82). 1 


By multiplying the partial solutions and combining the constants, one obtains the com¬ 
plete solution of the two-dimensional wave equation: 


u{x , y,t) = B sinfex + Zq) sin^y + <$ 2 ) sin(cot + 5). 


10.3 Inclusion of the Boundary Conditions 

With the given boundary conditions for u , we obtain 

u( 0, y,t) = B sinZq sin^y + 82 ) sin (cot + 8 ) = 0, 
u(x,0, t) = B sin^x 8 \) sin <$2 sin(cot + 8 ) = 0. 

Both equations are only satisfied for all values of the variables x,y,t if 
sin<$i = sin <$2 = 0, 

which is for example correct for 8 \ =82 = 0 . 

From this, we obtain the other boundary conditions: 

u(a,y,t) = B sin (k x a) sin^y) sm(cot + 8 ) = 0, 
u(x, b, t) = B sin^x) sin^Z?) sin(cot + 8 ) = 0. 

From considerations similar to those above, we find 
sin (k x a) = sin (k y b) = 0, 


1 One of the two separation constants k\ or ky could in principle be chosen to be negative, so that 
e.g., — ky = k 2 . In this case we would get Y = Ae k y' y + Be~ k y ' y , and the boundary conditions 

u(x,0,t) = u(x,b,t) = 0 could be satisfied only by A = B = 0. 
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from which we get 


k x a = n x Tt, k y b = n y Tt, with n x ,n y = 1,2 ,.... 

The values n x =n y = 0 must be excluded, since they lead to u(x, y, t) = 0—as for 
the vibrating string. 

Now we have 

* 2 =*?+*?=»5(j) +»?(£) ■ 

and because go = k - c, we find for the eigenfrequency 


M n x n y — ^ 


I _i_ 

a 2 b 2 ' 


10.4 Eigenfrequencies 


Thus, the eigenfrequencies of the rectangular membrane are 


^n x n y — Crt} 


a 2 ^ b 2 ’ 


where the lowest frequency is the fundamental harmonic : 


wii - C 7 r /J+J- 

For the string, we have co n = nco\ , i.e., the higher harmonics are integer multiples 
of the fundamental frequency. This is no longer valid in the two-dimensional case. 
Contrary to the harmonic frequency spectrum ( co n = nco\) of the string, the membrane 
has an anharmonic spectrum ( co nxriy neon). 


10.5 Degeneracy 

If in the special case of a square membrane, the edges have equal length, a — b, then 
it follows that 

cits/2 

(0nxny = —^— Mn , ®H = —■ 

The table of the ratios Go nxny /co\\ for several values of the “quantum numbers” 
n x , n y of a square membrane shows (see Table 10.1) that for different pairs of “quan¬ 
tum numbers” there exist the same eigenvalues, i.e., there are different possible eigen- 
vibrations with the same frequency. Such states are called degenerate. For a square 
membrane which is symmetric with respect to the meaning of the x- and y-coordinate, 
all states n x n y arranged symmetrically with respect to the main diagonal of the table 
are degenerate. 
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Table 10 . 1 . The ratio con x n y /co\ \ as a function of n x and n y 


n y \n x 

1 

2 

3 

4 

1 

1.00 

1.58 

2.24 

2.92 

2 

1.58 

2.00 

2.55 

3.16 

3 

2.24 

2.55 

3.00 

3.54 

4 

2.92 

3.16 

3.54 

4.00 


The degeneracy is removed at once if a ^ b. Generally degeneracies appear only 
in systems with definite symmetries. 

We further recognize that the square membrane contains a fraction of harmonic 
overtones (diagonal elements of the table). 


10.6 Nodal Lines 


At the points where the position-dependent part of the wave motion vanishes, the 
string has a node, and the membrane correspondingly has a nodal line. 

The position-dependent part reads 

n x nx n y ity 

sin-sm ——. 

a h 

Then for n x =2 and n y = 1 we have 


. 2 nx . ny 

sm-sm — 

a b 


0 


as the condition for a nodal line. 

Away from the edges this condition is still satisfied for the straight line v = a/ 2, 
which represents a nodal line for (n x , n y ) = (2, 1). In general all straight lines of the 
form 

ma nb 

x = —; y = — (m — 1 , 2 ,..., m < n x \ n = 1 , 2 ,..., n < n y ) 
n x n y 


are nodal lines. 


10.7 General Solution (Inclusion of the Initial Conditions) 

The general solution of the wave equation for the rectangular membrane, since it is a 
linear differential equation, is obtained as a sum of the particular solutions (superpo¬ 
sition principle): 

OO 00 


u(x,y,t)= C n x n y 

n x =l n y =l 

We now can evaluate the c nxU 


. n x 7 tx . 

sm-sm 

a 


and the (p„ x „ 


n y 7 ty 

— sin (C0n x n y t + <Pn x n y )• 
y from the initial conditions 


u(x , y, t = 0) = uq(x, y), 
u(x,y,t = 0) = vo (x,y). 
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n x = l 
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4 
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Fig. 10.3. Nodal lines of sev¬ 
eral eigenvibrations 
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For t = 0, the general solution and its time derivative read as follows: 


^ . . n x nx . n y ny 

uo(x,y)= 2^ c n x n y Sin (Pn x n y ' sm- sin ' 

n x ,n y = l 


b ’ 


^ . n x nx . YiyTiy 

l ) 0 (x,y)= 2_^ Mn x n y Cn x n y COS (p nx n y • Sin-SU1 - 

i ^ 

n x ,n y =1 


b 


We redefine the constants: 


An x n y — c n x n y sin (Pn x n y , 

B n x n y — < ^ > n x n y c n x n y COS (Pn x n y - 


The above equations then change to 


00 


U 0 (X, y) = ^ A n x n 


. n x 7tx . n y Tty 
sin-sm —-, 


n x — 1 

oo 


vo(x,y)= ^ B n x n y sin sin 

Yl x ,fl y —— 1 


. n x 7tx . UyTiy 
a b 


(10.5) 

( 10 . 6 ) 


(10.7) 

( 10 . 8 ) 


The coefficients A„ x „ and B„ xtly can be determined by means of the orthogonality 
relations. These read 
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a 

/ si 


n x 7tx . n x nx 

sin-sin- ax = adn rnr , 

a a 


(10.9) 


O 

-b 


HyTiy . YlyTCy 

sin —— sin —— dy = b8 h n 
b b 


We assume (10.7) to be continued across the borders as an odd function, multiply 
(10.7) by sin(h x 7tx/a), and integrate over v from —a to a. Next we multiply by 
sin(h y Tty/b) and integrate over y from — b to b: 


a b 


[ [ , x • . n y Jty 

/ / uo(x,y)sm -sin—- dxdy 

J J a b 


-a -b 


a b 


-<// 


. n x Ttx . n y 7ty 

uo(x, y) sin-sin —- dxdy 

a b 


o o 


n 

= J2 A n x n y / Sin - 

vi l/i J 


n x ,n y 

oo 


■Ttx n x Ttx / riyTty Yiyity 

-sin- dx / sin —-sin —- dy 

a a J b b 

-b 


— ^ ' An x ny&n x n x a &n y nyb — dbA Uxny . 
n x ,n y 

Likewise, we treat (10.8) to evaluate the coefficients B Uxn One then obtains 


a b 


An * n y - ab 


B 


n x n y 


// 

0 0 
a b 

-=// 


. n x 7tx . n y 7ty 

u o(x, y) sm-sin —— dxdy , 

a b 


( 10 . 10 ) 


n x 7tx n y 7ty 

vo(x, y) sin-sin —-— dxdy. 

a b 


o o 


With the knowledge of the A nxUy and B nxYly , one now can calculate the c nxYly and (p Hx 
from (10.5) and (10.6). 


10.8 Superposition of Node Line Figures 


In the case of degenerate vibrations of the membrane, there can also appear node lines 
that arise by superposition of the node line figures of the degenerate normal vibrations. 

As an example we consider the position dependence of the degenerate normal vi¬ 
brations of the quadratic membrane 


jtx 2 jry 2 ttx Tty 

u 12 = sm — sm-sino;i 2 ^ and U 21 = sm-sm — sin&> 2 iL (10.11) 

a a a a 
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For the superposition of the two normal vibrations, we write 
u = U \2 + Cu2 \. 

The constant C specifies the particular kind of superposition. The equation of the 
nodal line is obtained from u — 0. The common numerical factor sincu^ = sm<j 02 \t 
obviously factors out. For the special case C = ±1, we find 

. Ttx . 2ny . 2ttx . Tty 

sin — sin-zb sin-sin — = 0 

a a a a 


or, rewritten, 

TTX Tty ( ny JtX \ 

sin — sm — I cos — d= cos — ) = 0. (10.12) 

a a \ a a ) 

By setting the bracket equal to zero, we get the equations for the two nodal lines: 

y = x for C = — 1 and y = a — x for C = + 1. 

Fig. 10.4 illustrates the nodal lines. 


y 

a 

a_ 

2 


y 

a 


U 2\ 





a X 


a/2 a X 



Fig. 10.4. Nodal lines for de¬ 
generate eigenvibrations 


We recognize that new vibrations with new kinds of nodal lines can be constructed 
by superposing appropriate normal vibrations. One can excite such specific superpo¬ 
sitions of normal vibrations by stretching wires along the nodal lines (right figure) so 
that the membrane remains at rest along these lines. 


10.9 The Circular Membrane 

In the case of the circular membrane, it is convenient to change from the Cartesian 
coordinates to polar coordinates, i.e., from u = f (x, y, t) to u = \j/(r, (p,t). 

For this recalculation, we have 


x = r cos (p, 

, y 

tan ip = -, 
v 


y = r sm<p, 
r = y[x i + 




For the transformation of the Laplace operator, we need the derivatives 

dr y 
dy 

By differentiating the tangent, we get 


dr 

dx 


x 

r 


■- coscp, 


■■ sm (p. 


dtarup dtmupdcp 


1 d(p 


y 


dx 


dip dx COS z ip dx 


(10.13) 


(10.14) 


(10.15) 


y ‘ 



/y 

r A \ 



/<p\i \ 



X J 

X 


Fig. 10.5. Circular membrane 
(drum) 
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By inserting the polar representations for x and y, one gets dip/dx = —(sin ip)/r. The 
corresponding differentiation of tamp with respect to y yields dip/dy = (cos ip)/r. To 
get the two-dimensional vibration equation in polar coordinates, we first transform the 
Laplace operator A(x, y) to polar coordinates A(r, (p). The differential quotients are 
interpreted as operators. 

We demonstrate the calculation for the x-component; the recalculation of the 
y-component then runs likewise. According to the chain rule, we have 


3 3 dr 3 dip 

dx dr dx + dip dx 


(10.16) 


After insertion of the above results, we obtain 


— = COS ip — 

3x 3r 


3 sin ip 3 


dip 


(10.17) 


We square this result, taking into account that the terms act on each other as operators. 
(The square of an operator means double application.) 


d 1 


1 3 


= I cos ip — sm ip — — 

3x z V dr r dip 


3 1 3 

cos ip -sin <p - 

dr r dip 


(10.18) 


By multiplying out, one first gets the four terms 

"sin ip 3 sin<p 3 


a 2 ( a a 

= I cos<p— • cos<p 


3x 2 


3r 3r 

3 sin ip 3 


+ 


COS ip 


dr 


dip 


r dip r dip 

sin ip 3 3 

-• cos ip — 

r dip dip 


(10.19) 


We now treat the individual terms according to the product rule: 


COS ip 

sin<p 

r 

cos ip 
sirup 


— • cosc?— 
dr dr 

3 sin ip 3 
dip 


r dip 
3 sin ip 3 


dr 


r dip 

3 3 

— • cosc?— 
dip dr 


= cos 2 ip 


dr 2 ’ 


sin ip cos ip 3 sirr ip 3 


dip 


dip 1 


cos ip sin ip 3 ^ cos ip sin ip 3 3 


dip 


dr dip ’ 


sirr ip 3 ^ sin ip cos ip 3 3 


dr 


dip dr 


From this, one obtains 


3 2 2 sin 2 ip 

S2=“ S *>^ + ^ 


3 d 1 

dr dip 2 


2 sin ip cos ip f 3 3 3 

dip dip dr 


Analogously, one gets for the y-component 

3 2 . 2 9 2 cos 2 <p f 3 3 2 \ 2 sin ip cos ip ( 3 3 3 \ 

3y 2 ^ dr 2 r 2 \ 3r dip 2 ) r 2 \3<^ dip dr) 
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By adding both expressions, we obtain the Laplace operator in polar coordinates: 


9 2 9 2 


9 2 19 1 9 2 


9x 2 dy 2 dr 2 r dr + r 2 9^> 2 

The vibration equation then takes the following form: 

d 2 u(r,cp,t) 13 u(r,<p,t) 1 3 2 u(r,cp,t) 1 d 2 u(r,cp,t) 


dr 2 


r 3r 


+ 


r 2 3^? 2 c 2 3^ 2 


( 10 . 20 ) 


( 10 . 21 ) 


The equation of motion is solved again by separation of the variables. We use a product 
ansatz for separating the position and time functions: 


u(r, (p, t) = V (r, (p) • Z(t). 

By insertion into the wave equation, we obtain 


z(0 


g 2 v i dv i 9 2 y 

dr 2 r dr r 2 dcp 2 


_ 1 9 2 Z 


We divide both sides by V(r, <p) - Z(t)\ 

d 2 v i lay i l d 2 v 

dr 2 - r dr r 2 d<p 2 1 Z(t ) 

V(r, <p) “ c 2 Z(0’ 

As a separation constant, we choose 

c 2 T ~ 

and introduce the angular frequency by 
= ck. 

From this, we get 
Z+® 2 Z=0 
with the solution 

Z(t) = C sin (cot + 3). 

By insertion of the constant — k 2 , the equation of motion takes the form 


d 2 V 1 dV 1 d 2 V l2 

9 ^ + 7 97 + ^ 9 ^ + * y “°- 


We separate the radial and angular functions by a second product ansatz : 
V(r, <p) = tf(r) -0(<p). 


( 10 . 22 ) 

(10.23) 

(10.24) 

(10.25) 

(10.26) 

(10.27) 

(10.28) 

(10.29) 

(10.30) 


Hence, we obtain 

d 2 R , 1 dR 1 d2( t> 

dr 2 ^ r dr r 2 dip 2 ^2 _ q 

*(r) <K<p) 


(10.31) 
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We separate the variables by multiplying by r 2 : 


r 2 (d 2 R/dr 2 ) + r{dR/dr ) 

W) 


+ k 2 r 2 + 


d 2 <p/dcp 2 

4>(<p) 


= o. 


(10.32) 


Here again, the equation is valid only then if both functions are constants. Hence, we 
choose 


1 d 2 <fi 

0 dip 2 


(10.33) 


from which one obtains as a solution for <p(cp) 

= Ae^ v + Be-'vW' 

= C sin(m<p + 8) with m = m = 0, 1,2, 3,_ (10.34) 


m must take only integer values to get the periodicity of the solution. At the angle 
2 tv + (p, the solution must be identical with that for the angle (p. This fact is often 
described by the phrase periodic boundary conditions. 

Now we can admit—without restricting the problem—only positive m, since with 
negative m only the sense of rotation angle is inverted. 

Thus, the equation of motion for the radial function R looks as follows: 


> d 2 R 
dr 2 


dR 


r,2 2 i 


or 


d z R 1 dR 


dr 2 r dr 


m 


+ -— + U )R = 0 


(10.35) 


We substitute z = kr, dr =dz/k. Then we get 


9 d 2 R k 2 dR ( 9 m 2 k 2 
k + ~Tz + \ k 
d 2 R 


1 dR ( m 
dz 2 + ~z^~z + V~V ]R 


R =0, 

0. 


(10.36) 


In this form, the equation is called Bessel’s differential equation. This differential 
equation and its solutions appear in many problems of mathematical physics. 


10.10 Solution of Bessel's Differential Equation 2 


The solution of our differential equation 


^ 2 gfe) 

dz 2 


+ - 

z 


1 dg(z) A m 


dz 


+ [l-—\g( z ) = 0 


(10.37) 


2 Friedrich Wilhelm Bessel, b. July 22, 1784, Minden-d. March 17, 1846, Konigsberg (Kaliningrad). 
Bessel was first a trade apprentice in Bremen, then until 1809 an assistant at the observatory in 
Lilienthal, and then professor of astronomy in Konigsberg and director of the observatory there. In 
1838 he succeeded in measuring the annual parallax of the star 61 Cygni, thus becoming the first to 
determine the distance to a fixed star. As a mathematician Bessel was best known for his investigations 
on differential equations and on Bessel functions. 
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cannot be found by integration. Approaches using elementary functions also fail. We 
therefore try with the most general power series expansion: 


g(z) = z lx 



(10.38) 


The separation of a power factor is not necessary, but will prove to be very convenient. 

Since in the center of our membrane the vibration remains always finite, g(z) must 
not have a singularity at z = 0. But since for z -> 0 we have 

g(z)^a 0 z^, (10.39) 

for these physical reasons we must have /x > 0. To get a more general statement, we 
consider the asymptotic behavior of Bessel’s differential equation for z -> 0 for at first 
arbitrary /x. 

We then can set as above 

g(z)^a 0 z^ (10.40) 

and obtain by inserting: 

li(ji - l)z M-2 + + z' x - m 2 z fl ~ 2 = (/r(M - 1) + M + z 2 - m 2 )z^~ 2 

^(lu 2 -m 2 )z IJ '~ 2 = 0, (10.41) 

since for z —»■ 0 we also have z 2 —»■ 0. We thus have the condition 


II 2 — m 2 = 0. (10.42) 

For the above-mentioned reasons, which are of a purely physical nature, it follows that 

/x = in , me No. (10.43) 

The constant m is itself an integer. To see this, we remind ourselves of the angular 
dependence of the total solution, namely, 

f{(p) = sin (rmp + 8). (10.44) 

Since after a full revolution we return again to the same point of the membrane, the 
solution function must have the period 2 tt . But this holds only then if m is an integer! 
We now try to determine the coefficients of our ansatz 


g m (z) = z m (ao + a\z + ci 2 Z 2 ...), m = 0,1,2, — 


(10.45) 
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For this purpose, we insert the ansatz in the Bessel equation. The individual terms of 
this equation then have the following form: 

j 2 

_____ _ z m ~ 2 (a 0 m(m — 1 ) + a\(m + \)mz + d 2 (m + 2 )(m + l)z 2 
+ < 23 (m + 3)(m + 2 )z 3 H-), 

= z m ~ 2 (aom +ai(m + l)z + ft 2 (m + 2)z 2 + a^m + 3)z 3 H-), 

z dz 7 

g(z) = z m ~ 2 (a 0 z 2 + ftiz 3 H-), 

2 

/-\ /-\ /-% o o o q 

- 3 -g(z) = z m ~ (—aom — a\m z — ci 2 m z — a^m z -). 

The sum of the coefficients for each power of z must vanish, i.e., ao(m(m — 1) + 
m — m 2 ) = 0. Since the bracket vanishes, ao can be arbitrary. 

For ai, we get 

( 2 i (m(m + 1 ) + (m + 1 ) — m 2 ) = 0 , 

( 2 i( 2 m + l)=0, i.e., a\ = 0. (10.46) 

From the coefficient of z m , it follows that 

(22 ((m + 2 )(m + 1 ) + (m + 2 ) — m 2 ) + ao = 0 


or 


(22 (4m +4) = -ao. 


(10.47) 


Furthermore, we get 


(23 ((m + 3)(m + 2) + (m + 3) — m 2 ) + a\ = 0, 

(23 ( 6 m + 9) = —( 2 i, i.e., ( 23 = 0 . 


(10.48) 


Generally, we find the condition equation 

( 2^+2 ((m + p + 2)(m + p + 1) + (m + p + 2) - m 2 ) + ( 2 ^ = 0, 

0^+2 ((m + + 2) 2 - m 2 ) = -(2^, 

77+2 (m + /? + 2) 2 — m 2 (/? + 2) (2m + p + 2) 


(10.49) 


This recursion formula allows one to determine the coefficient a p +2 from the preced¬ 
ing cip . Because a\ = 0, it follows that all d2 n -i vanish, i.e., in the series expansion 
of the solution function there appear only even exponents. For these one obtains with 
( 2 o / 0 : 


<32 n — 


—^2n—2 

2ft (2m + 2ft) 


-<32/z-2 
2 ft 2 (m + ft) 


(10.50) 
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In the next step, we replace a2 n -2 by ci2n-4 and obtain 

_ +^2n—4 

Uln ~ 2M2n - 2)(2m + 2n)(2m + 2n - 2) 

_ _ <22n-4 _ 

2 2 ft(ft — l)2 2 (m + n)(m + n — 1) 

By continuing this way, we can relate a2 n back to ao. We obtain 

(-l) n ao 

Cl2n — 

2 n n(n — 1) • • • 1 • 2 n (m + n)(m + n — 1) • • • (m + 1) 

(-D^o 

2 2n n\(m + n)\/m\ 

Thereby, we obtain the following solution functions: 


00 

gm(z) =aoz m w!y] 

rc=0 


(-ir 

n\(m -\-n) \ 2 2n 


(10.51) 


(10.52) 


(10.53) 


With the special choice ao • ml = 2 m , we obtain the Bessel functions: 


Jm(z) — 


m oo 

E 


n =0 


(-if 

n\(m + n)\ 


In 


oo 


= E 


(-if 

n\(m + n)\ 


2 n+m 


(10.54) 


The graph of the first Bessel functions is given in Fig. 10.6. We see that for large 
arguments the Bessel functions vary like the trigonometric functions sine or cosine. 
Now we can immediately write down the solutions of our differential equation: 


V m (r, (p) = c m J m (kr) sin(m(p + S m ). 


(10.55) 



Fig. 10.6. Graphical represen¬ 
tation of the lowest Bessel 
functions 
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The membrane cannot vibrate at the border r = a, i.e., the boundary condition reads 
V(a,(p) = 0 for all ip. 

From this, we obtain the condition 


Jffi (Jc ’ a) — 0, 


from which the eigenfrequencies can be determined. For this purpose we must find the 
zeros of the Bessel function: 


z} z^ 

/0fe) = 1 - 4 + 65 - 
Tf , z z\z 5 
Jl(z) -2~16 + 3U 


+ ••• = 0 , 

- + ••• = 0, 


etc. 


(10.56) 


These zeros—except for the trivial ones for z = 0—can in general not be determined 
exactly; they must be calculated by numerical methods. If we denote the nth node of 

the function J m (z) by z ^, we obtain the following table for the values of the first 

S™). 

Zn 


Table 10.2. Zeros of the Bessel functions. 


n 



m 



0 

1 

2 

3 

4 

5 

1 

2.41 

3.83 

5.14 

6.38 

7.59 

8.77 

2 

5.52 

7.02 

8.42 

9.76 

11.06 

12.34 

3 

8.65 

10.17 

11.62 

13.02 

14.37 

15.70 

4 

11.79 

13.32 

14.80 

16.22 

17.62 

18.98 

5 

14.93 

16.47 

17.96 

19.41 

20.83 

22.22 

6 

18.07 

19.62 

21.12 

22.51 

24.02 

25.43 

7 

21.21 

22.76 

24.27 

25.75 

27.20 

28.63 

8 

24.35 

25.90 

27.42 

28.91 

30.37 

31.81 

9 

27.49 

29.05 

30.57 

32.07 

33.51 

34.99 


Useful approximate solutions may also be obtained by considering the asymptotes 
of the Bessel functions for z -> oo. Then 


Jm(z) 



mn 

~Y 



(10.57) 


We give this without proof. A look to the graphical variation of the Bessel functions 
shows the close analogy with the cosine function for large arguments. 

From this one can determine zeros: 


cos I zi m) - 


z(m) _ 

-Cm 


mn 


-?'=° 


mn 


n n 

- — = nn - —, 
4 2 


-(m) 


mn n n 

= nn H-= (An + 2m — 1) —. 

2 4 v J 4 


(10.58) 


A comparison of these values with the exact ones from Table 10.2 shows that particu¬ 
larly for n large compared to m one obtains good approximate values: 
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Table 10.3. Comparison of the exact zeros of the Bessel functions with those obtained from the 
asymptotic approximation 



m = 0 


m = 5 


zf 

zf 

zf 

zf 

n = 1 

2.41 

2.36 

8.77 

10.21 

n = 2 

5.52 

5.49 

12.34 

13.35 

n — 9 

27.49 

27.49 

34.99 

35.34 


With the exact solutions the boundary condition is 

Um) m _ (m) Urn) _ }_ _ (m) 

K n a ’ K n 

For the eigenfrequencies, we get 

oU =k^-c= C ~- 4 m) = coo • 4 m) • (10.59) 

Thus, Table 10.2 also shows the values for the ratio co^ /coo. By drawing all these 
eigenfrequencies along an axis, one arrives at Fig. 10.7. The distances between the 
individual eigenfrequencies are fully chaotic. Thus, we are dealing with extremely 
anharmonic overtones. This is the reason why drums are badly suited as melodic in¬ 
struments ! 


co r 


(0) 


CO, 


( 1 ) 


CO i 


CO 


(2) (0) 


CO n 


(3) 


CO, 


(1) (4) 


CO. 


CO 




H-h 


The general solution of the vibration equation is the superposition of the normal 
vibrations. It now reads 

u(r, cp, t ) = y ^4 m) / m (^ m) r) • sin {mcp + 8 m ) • sin(^ m) ^ + (10.60) 

m,n 

In analogy to the Fourier analysis, the c^ can be found so that u(r, cp, t) can be 
adjusted to any given initial condition u(r,cp, 0) or u(r,cp, 0). 

Finally, we want to get a survey of the nodal lines of the vibrating membrane. On 
these lines we must have 

Wm,n(r, cp) = c^Jmik^r) ■ sin (mcp + S m ) = 0. (10.61) 

Then we get nodal lines if either 

J m (kh m) r) = 0; (10.62) 


this is realized for 


r 


(m) 



Fig. 10.7. Linear representa¬ 
tion of the eigenfrequencies of 
the circular membrane 


/ = 1,2, — 1; 


(10.63) 
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or if 


sin (nup + 8 m ) = 0, 
i.e., for angles 


For the first nodal lines, we get Fig. 10.8 (with 8 m = 0). 



m-2 




(10.64) 


(10.65) 


EXAMPLE 


10.1 The Longitudinal Chain: Poincare Recurrence Time 

The equations of motion for a system with n vibrating mass points which are con¬ 
nected by n + 1 springs of equal spring constant k read 

[ + k(X 2 —x\) 

- k(x 2 - X\)+k(X3 - X2) 

— k(x 3 — X2)+ k(X 4 — X 3 ) 

: ( 10 . 66 ) 

~k(x n -1 - x n - 2 ) + k(x n -x n -i) 

— k(x n — x n -\) — kx n . 


mx 1 

= — 

mx 2 

= 

mx*3 

— 

miV 

_ l = 


— 

With 



f Xi\ 


X 2 

r = 



\ x n / 


Fig. 10.9. 
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this can be written succinctly as 
mr = Ck r, 
where 

/—2 1 \ 


c = 

1 -2 1 

1 - 

2 1 

and r = 

^*0 

x 2 



1 

-2 1 

1 -2) 


\%n J 

With the ansatz 




r = acos&tf, a = 

/fli\ 

<32 

5 




\ a n ) 




we look for the normal modes: 


(kC + mr E n ) a cos cot = 0, 


= D n(C0) 


D n (co) 


(mco 2 — 2k k \ 

k mco 2 — 2k k 


v 


t 


Here, 


E n 


/1 

0 . 

• °\ 

0 

1 . 

. 0 

\o 

0 . 

.. 1 / 


(10.67) 


( 10 . 68 ) 


(10.69) 


represents the unit matrix. 

For nontrivial solutions for a, the determinant D n (< co ) of the matrix D /? (co) van¬ 
ishes. Furthermore, we use for the coefficients a an ansatz with phase 8 and y to be 
determined. 


Example 10.1 


a := (aj = sin (jy -8), j = 1,2 


(10.70) 


The evaluation of the line j yields 


kaj -1 + (m&> 2 — 2k)<z/ + k<2j + i = 0, 
k sin((j — l)y — 8) + (mm 2 — 2k) sin(jy — 5) + k sin ((j + 1)}/ — 5) = 0, 

=>- k cos y + (m&> 2 — 2k) + k cos y = 0, 




9 k 

co = 2 —(1 — cosy), 
m 



co = 2- 


(10.71) 
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We know that the characteristic polynomial has n zeros: 


(Oi =2,/-sin—, i = 1,2,... ,n. (10.72) 

V m 2 

The boundary conditions ao = a n +\ = 0 must be satisfied. The first one requires that 


sin 8 = 0, hence 8 = lit, l e Z, and therefore w.l.o.g. / = 0. 

From the second boundary condition, it further follows that 

in 


sin((n + l)n) =0 


Vi 


n + 1 

We summarize the result for the i th eigenmodes: 
in 


, for all i e {1,..., n} 


r, (0 = sin j 


n + 1 


- cos a>it, j = 1, 2,..., n 


with 


k in 
cot — 2 J — sin ; 


m 2n-\-2 


(10.73) 


(10.74) 


(10.75) 


(10.76) 


The general solution of (10.66) is a superposition of the various eigenmodes, i.e., 
a vector r (t) with the components xj(t): 


Xj( 


i=\ 


;(t) = y^(c z - -coscoit + bi -sincoit) -sin( j • —), for j = l,2,...,n. (10.77) 
^ n + 1 ' 


The coefficients sin(jin/(n + 1)) are, according to (10.70), (10.73) and (10.76), the 
components of the eigenvector to the i th mode, and since D n (co) is symmetric, the 
latter ones represent an orthogonal basis in W 1 . 


7 = 1 




In 

n + 1 


n + 1 


Sn • 


(10.78) 


We explicitly check this relation in the following Exercise 10.2. We define the ortho¬ 
normal eigenmodes a z : 


“ = = .. 

or in detail 


nni 1 

sm-\. 

n + 1 J 

The general solution can then be written as follows: 

n 

r(t) = cos cot t + Z?; sintUiO^i- 

/=! 


a,- = 


n + 1 


7T i 2 ni 

sm-, sm-. 

n+1 n+1 


(10.79) 


(10.80) 
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The following interesting question arises: Let the system of n mass points (degrees 
of freedom) at the time to be at r(*o) = ro with the velocity r(to) = ro. The system 
moves away from this configuration, but after a certain time r it can closely approach 
the initial configuration and possibly return exactly into the initial configuration. We 
call this time r the Poincare recurrence time? One looks for the difference between 
the actual time-dependent state vector in the phase space (rand the start 
vector (r 0 ,r|f=o): 


e(t) =■■ yj ||r(0 - roll 2 + ||r(0 - r| f=0 ||^. (10.81) 

The index Q at the second scalar product for the velocities indicates a diagonal weight 
matrix Q which is suitably included into this normalization: 


/ 1 


CD, 


Q = 


CDX 




(10.82) 


(Di are the eigenfrequencies. In this way the factors coi obtained by differentiation of r 
in (10.80) cancel out. So it is guaranteed that both terms under the root in (10.81) have 
the same dimension. We formulate with (10.80) the following initial value problem: 


n 

r(0) = r 0 = 

/=l 

n 

r(0) = fo = 0 = ^ bi&iCDi =$ bj = 0 for all i. 
i =l 


For this choice, the distance e(t) in phase space, given by (10.81), is 


s(t)= / r Q • r Q — 2r 0 • y coscojtSLi + 


cf cos 2 CDi t + 


cf sin 2 cot t 


(10.83) 


Because ro = ci a;, this turns into 


s(t) = 


N 


E 

i =1 


c 2 ( 1 — 2 cos CDi t + cos 2 CDi t + sin 2 CDi t) 


=1 


= 2 


E 

N '' =1 


2 • 2 
cf sin 


CDit 

~Y 


(10.84) 


It is easily seen that this expression after t = 0 vanishes again only if the eigenfrequen¬ 
cies CDi are related by rational fractions. In the general case s(t) is only conditionally 


Jules-Henri Poincare. See also footnote on page 461. 
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Example 10.1 


periodic. This notion will be explained more precisely as follows: We first consider the 
purely periodic case. The period is then determined by the lowest of the frequencies 
that are related by rational fractions: 



in 

2n + 2 


We denote it by do = q • co\ (q e Z+), 


(10.85) 


[k n [k n 

(o = 2qJ — sin—— ^2qJ — ——. (10.86) 

y m 2n + 2 y m2n +2 

The last approximation holds for ft 1 (many mass points). To this frequency corre¬ 
sponds the time 


2n f~k ft + 1 

r =- =2 J -—. (10.87) 

qoo\ V m q 

This is the Poincare recurrence time , since after this time s(t) vanishes again, i.e., 
the initial configuration in the phase space is reached again after the time r. For very 
many mass points (ft —> oo), this time tends to infinity 


r -> oo, for ft —> oo. 


( 10 . 88 ) 


This is an important and physically plausible result: After preparation of an initial 
configuration, the system develops with time away from this configuration. At some 
time, just after the Poincare recurrence time, the state of motion of the system returns 
to the initial state (or in the general case very close to it). However, in the case of a 
great many degrees of freedom, the system “escapes” and the recurrence time becomes 
oo. For example, if one of the n masses coupled by strings—say, the first one—is being 
pushed (this corresponds to setting of ro and ro), the energy of this motion will spread 
more and more over the other masses. After the Poincare time r the first mass will 
have regained the entire energy. Only in the case of infinitely many coupled masses 
will this no longer happen, since r —> oo. This is of great importance for the statistical 
behavior of systems of particles. 

Addendum: Periodic systems with several degrees of freedom: Conditionally pe¬ 
riodic systems. Let us define a periodic system with several degrees of freedom as a 
system for which according to (10.83) the orthogonal coordinates used in the descrip¬ 
tion are periodic functions: 

r i = SLi cos coit. (10.89) 


The quantities r z - = 2n/coi are the periods belonging to the coordinates x z . In analogy 
to (10.77) and (10.80), we expand the general configuration vector r (t) in a Fourier 
series 


r (t) = (q cos coit + bi sino) z T)a z . (10.90) 

i 

The Fourier series (10.90) is in general no longer a periodic function with respect 
to the time t, although every individual term is periodic. The periodicity is assured 
only for such degrees of freedom whose frequencies co \, C 02 , • • • are related by rational 
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fractions. Therefore systems with several degrees of freedom are called conditionally 
periodic systems. 

The number of frequencies which are related by rational fractions determines the 
degree of degeneracy of the system. If there are no relations of this kind, the system 
is non-degenerate. If all frequencies are rationally related to each other, the system is 
called fully degenerate. In this case we observe a periodic time function. 

The Kepler problem treated earlier is an example for a system with two degrees 
of freedom (r, cp) which is degenerate and thus has only one frequency. By inventing 
a perturbation, say a quadrupole-like potential with the typical variation 1/r 3 , the 
degeneracy can be removed, which causes a rosette-like motion. 

As an example of a conditionally periodic motion, we note the anisotropic linear 
harmonic oscillator, which is a mass point with different spring constants in the vari¬ 
ous Cartesian directions. The trajectory of the mass point is a Lissajous figure which 
never turns into itself and in the course of time tightly covers the area given by the 
amplitudes. Only in the case of degeneracy there are periodicities in the motion. 

In the discussion of the Poincare recurrence time, we assumed a periodic motion. 
In the case of a conditionally periodic motion, the situation is—as was expected— 
completely analogous. In this case, after the Poincare recurrence time t the configu¬ 
ration vectors r (t), r (t) come very close to the initial configuration ro, ro- The initial 
configuration will not be reached again, but will be reached “nearly” again after the 
time z. For further discussion, we refer to the literature. 


EXERCISE 


10.2 Orthogonality of the Eigenmodes 


Problem. Prove the orthogonality relation for the eigenmodes 

Itc \ {n + V 


E si 


sin j 


i =1 


ITT 

12 + 1 


sm j 


n + 1 




which is explicitly used in (10.78) of the last example. 
Solution. Proof of orthogonality of the eigenvectors: 


7 = 1 


da = y>m( i ff [ j ) sin (/ 


jtj 

7=1 1 


(k — 1)tt 

cos-/ — cos 

12+1 


22 + 1 

(k + 1)tt 


22 + 1 


(10.91) 


Before continuing the exposition, we evaluate the sum of the following series: 

sin (x 12 / 2 ) cos x(n + l)/2 cos(vi 2 / 2 ) sinv(i 2 + l)/2 
COS kx = -;-—- = -;-—-1. (10.92) 


k =l 


sinx/2 


sinx/2 


This result is easily obtained by writing the cosine in terms of exponential functions 
and then evaluating the sum as a geometrical series. 
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Exercise 10.2 


The case k = l yields 


dkk — 


1 

2 




1 

2 




+ 1), (10.93) 


since sink7r = 0 for all k. 

In the case k ^ /, we calculate the sums in both alternatives given in (10.92): 


dk, = 2 


(k-l)xn j (k-l)n 
(«+l)2 bln 2 

sin fcOZL 

MI1 2(n+l) 




dkl '■ 


1 sm 


(k-l)nn 

0+1)2 


COS 


(k-l)TT 


sm 


(k—l)n 

20 + 1 ) 


• Q+/)7TW Q+QtT 

Sin 20+1) 2 


sin 


0+/)tt 

20 + 1 ) 


(10.94) 


(10.95) 


The vanishing of du(k / /) is immediately seen from (10.94) for even k — l and k + /, 
and from (10.95) for odd k — / and k + /. 


Supplement: Everything Happened Already —>1 Physical Theorem? 


Near the end of the nineteenth century, many physicists discussed the hypothesis 
that the course of the world repeats in eternal cycles. This interest was stimulated 
mainly by the works of Henri Poincare (1854-1912). Also a philosopher like Friedrich 
Nietzsche (1844-1900) was tempted by this theorem to a short guest performance in 
physics. The first speculations along these lines originated from almost nonscientific 
attempts to explain the phenomenon of heat. The Lord of Verulam (1561-1626), Fran¬ 
cis Bacon, had already identified heat as a form of motion but had failed to construct 
a quantitative theory. For lack of systematic investigations he had included the de¬ 
velopment of heat in dung hills in his considerations. The topic attracted more and 
more actors from physics, metaphysics, philosophy, politics and theology. We repro¬ 
duce here two quotations from Poincare’s works from this era. Henri Poincare in 1893 
wrote in “Review of metaphysics and morality”: 

Everybody knows the mechanistic world view that tempted so many good people, and the vari¬ 
ous forms in which it comes up. Some people imagine the material world as being composed of 
atoms which move along straight lines because of their inertia, and change their velocity only 
if two atoms collide. Other people assume that the atoms perform an attraction or repulsion 
on each other, which depends on their distance. The following considerations will meet both 
points of view. 

It would possibly be appropriate to dispute here the metaphysical difficulties that are related 
to these opinions, but I don’t have the necessary expert knowledge. Therefore I will deal here 
only with the difficulties the mechanists met when they tried to reconcile their system with the 
experimental facts, and with the efforts they made to overcome or to elude these difficulties. 

According to the mechanistic hypothesis all phenomena must be reversible; the stars for 
example could move along their orbits also in the opposite sense, without conflicting with 
Newton’s laws. Reversibility is a consequence of all mechanistic hypotheses. 

A theorem that can easily be proved tells us that a restricted world, which is governed only 
by the laws of mechanics, will pass again and again a state that is very close to its initial state. 
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On the other hand, according to the assumed experimental laws the universe tends towards a 
certain final state which it never will leave. In this final state which represents some kind of 
death, all bodies will be at rest at the same temperature. 

The doubts provoked this way, accompanying the developing theory of heat based 
upon an irreversible motion of atomic particles, have not yet been clearly removed. 

A classical illustrative example of Poincare’s “recurrence objection” is the parti¬ 
tioned box, one half filled with gas which uniformly distributes over the entire box 
after removal of the membrane. Experience tells us what happens, and an inversion of 
this “irreversible process” is never observed in practice. But Poincare did not think at 
all of an inversion, but rather of the chance that brought the particles into the ini¬ 
tially empty half. This chance—after some “appropriate time”—should also bring 
them back again to the initial half. 

In 1955, Enrico Fermi, 4 John Pasta, 5 and Stanislaw Ulam 6 considered a problem 
which corresponds to our Example 10.1, except for the additional inclusion of a non¬ 
linear coupling term. Their interest focused on finding, by means of the first com¬ 
puters, recurring processes such as we looked for in our purely linear problem. Sur¬ 
prisingly, they found an almost perfect recurrence of the initial conditions after large 
numbers of oscillations. The investigations and reflections of such properties of non¬ 
linear wave equations continue to this day and have been introduced into the theory of 
elementary particles (solitons). 


4 Enrico Fermi, Italian physicist, b. September 29, 1901, Rome-d. November 28, 1954, Chicago. 
Following studies in Pisa and research stays in Gottingen, Leiden and Rome, Fermi became a profes¬ 
sor of physics in Rome in 1925. He built up a research group which achieved leading experimental 
and theoretical results in nuclear physics. In 1938, Fermi was awarded the Nobel Prize in physics for 
his work on radioactive elements created by neutron bombardment and nuclear reactions triggered 
by slow neutrons. In the same year he left Italy, worked at Columbia University in New York and 
finally became professor in Chicago. During the war, Fermi was involved in the Manhattan Project to 
produce the nuclear bomb, and was the driving force in the development of the first nuclear reactor in 
1943. Fermi did seminal work in both experimental and theoretical physics, contributing to statistical 
mechanics, the general theory of relativity, and the theory of weak interactions. 

5 John Pasta, American physicist and computer scientist, b. 1918, New York-d. 1984, Chicago. 
Following different employments with police and the military, Pasta obtained a Ph.D. in theoretical 
physics and joined the Los Alamos National Laboratory, where he worked in the group of Nicholas 
Metropolis, constructing the MANIAC I computer. Later he became expert for computing with the 
Atomic Energy Commission, and in 1964 professor for computer science at the University of Illinois 
in Urbana-Champaign. 

6 Stanislaw Marcin Ulam, Polish mathematician, b. April 13, 1909, Lemberg (today Lviv, Ukraine)- 
d. May 13, 1984, in Santa Fe, New Mexico. Ulam was a student of mathematics with Stefan Banach 
and contributed to measure theory, topology, and ergodic theory. In 1938 he came to the US as a Har¬ 
vard Junior Fellow, and later joined the Manhattan project, on the intervention of John von Neumann. 
Together with von Neumann, he developed the Monte Carlo method to solve numerical problems 
using random numbers. Ulam suggested the functional principle of the first hydrogen bomb. 
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As we have seen in Chap 4, a rigid body has 6 degrees of freedom, 3 of translation 
and 3 of rotation. The most general motion of a rigid body can be separated into the 
translation of a body point and the rotation about an axis through this point (Chasles’ 
theorem). In the general case the rotation axis will change its orientation too. The 
meaning of the 6 degrees of freedom becomes clear once again: The 3 translational 
degrees of freedom give the coordinates of the particular body point, 2 of the rotational 
degrees of freedom determine the orientation of the rotation axis, and the third one 
fixes the rotation angle about this axis. 

If a point of the rigid body is kept fixed, then any displacement corresponds to a 
rotation of the body about an axis through this fixed point (Euler’s theorem). Hence, 
there exists an axis (through the fixed point) such that the result of several consecutive 
rotations can be replaced by a single rotation about this axis. 

For an extended body the vanishing of the sum of all acting forces is no longer 
sufficient as an equilibrium condition. 

Two oppositely oriented equal forces —F and F that act at two points of a body 
separated by the distance vector 1 are called a couple. A couple causes, independent 
of the reference point, the torque 


D = 1 x F. 


While the torque on a mass point is always related to a fixed point, the torque of a 
couple is completely free and can be shifted in space. 

The forces acting on a rigid body can always be replaced by a total force acting on 
an arbitrary point, and a couple. This can easily be shown by the following example: 
At the point P\, the force Fi acts. Nothing is changed if we let the forces —Fi and 
Fi act at O'. The force Fi acting on P\ and the force —Fi acting on O' represent a 
couple, and there remains the force Fi acting on O'. 

If there are several forces acting, we combine them into the resultant force F = 
JA F i. The torque is then given by D = • r • x F;. 

An extended body is in equilibrium if both the total force and the total torque 
vanish: 



+F 


Fig. 11.1. A couple causes a 
torque 


E F ‘=° 
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11 Rotation About a Fixed Axis 



Fig. 11.2. The forces acting 
on a rigid body are equivalent 
to a total force and a couple 



Fig. 11.3. Rotation about the 
fixed z-axis with the angular 
velocity co 


and 

Ef xF <=° 

i 

(equilibrium condition at the point O r ). 

For the calculation of the equilibrium condition, the origin of the vectors r, (refer¬ 
ence point of the moments) is arbitrary. Actually, for the point O it follows that (see 
Fig. 11.2) 

E F ‘=° 


and 

E r i x F; = E< a + f) x F; = a x E F,: + E f x F; = o, 

i i i i 

i.e., the condition that the sum of all forces and the sum of all torques must van¬ 
ish. 


11.1 Moment of Inertia (Elementary Consideration) 

A rigid body rotates about a rotation axis z fixed in space. By substituting the angular 
velocity Vi = co • ri for the velocity in the kinetic energy, one obtains 

T = Y -rriivf = -co 2 m,r 2 = -Geo 2 . 

^2 2 ^ 2 
i i 

Analogously, for the angular momentum in z-direction we have 

L z = E m i r i "i=®E m * r ' 2 = 0ft) ' 

i i 

Here, ri is the distance of the i th mass element from the z-axis. 

The sum appearing in both relations is called the moment of inertia with respect to 
the rotation axis. One has 

© = ymjr 2 . 

i 

To calculate the moments of inertia of extended continuous systems, we change from 
the sum to the integral, i.e., 

0= J r 2 dm = J r 2 QdV , 

body body 


with q representing the density. 
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For a spatially extended, not axially symmetric rigid body which rotates about the 
z-axis, there can also appear components of the angular momentum perpendicular to 
the z-axis: 

L = ^ ~2m v r v x v y = '^2,m v r v x (oj x r v ) 

V V 

= y^m y o)(Xy, y v , z v ) x (~y v ,x v , 0) 

V 

= co ^2(-x v Zv, -y v Zv,xl + yl)m v . 

V 

Since the body is supported in such a way that the rotation axis is constantly fixed, in 
the bearings appear torques (bearing moments) D = L. They can be compensated by 
“balancing,” i.e., by attaching additional masses so that the deviation moments 

-^x v z v m v and - y ^ j y v z v m v 

V V 

vanish. 


EXAMPLE 


11.1 Moment of Inertia of a Homogeneous Circular Cylinder 


We determine the moment of inertia of a homogeneous circular cylinder with density q 
about its symmetry axis. Adapted to the problem, we use cylindrical coordinates. The 
volume element then reads dV =r dr dip dz, and dm = q dV. The moment of inertia 
about the z-axis is then given by 


2tt 


0 = 


J r 2 dm = q J dcp J dz J 


r 3 dr; 


cylinder 


integration over the angle and the z-coordinate yields 


0 = 


R 



r 3 dr. 


Integration over the radius yields 

IX a 9 R 1 9 

0 = - hpR 4 = Q7xR 2 h— = -MR 2 . 
2 * 2 2 


Z 



h 

-It's 



Fig. 11.4. A homogeneous cylin¬ 
der rotates about its axis 
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Steiner's Theorem 1 


If the moment of inertia © 5 with respect to an axis through the center of gravity S of 
a rigid body is known, the moment of inertia © for an arbitrary parallel axis with the 
distance b from the center of gravity is given by the relation 

© = 0* + Mb 2 . 

If A B is the axis through the center of gravity and A'B' the parallel one with the unit 
vector e along the axis, this can be shown as follows: 


&AB = y]m v (r v x e) 2 , 

V 


®A'B> ='Y^m v (r' v xe) 2 . 

V 



The relation between r v and r' v is given by Fig. 11.5. Obviously r' v = — b + r v , and 
therefore, 

®A'B' = y] W v ((-b + r„) X e) 2 

V 

= y^m y [(-b x e) + (r„ x e)] 2 

V 

= y^m v (-b x e) 2 + 2y^m v (-b x e) • (r„ x e) + y^jn v (r v x e) 2 

V V V 

= Mb 2 + @ AB . 


The middle term vanishes because 
2(— b x e) • I ^m v r y I x e = 0, 

' V ' 

since S is the center of gravity and hence m v r v = 0. 

If for a planar mass distribution the moments of inertia ©**, ©^^ in the x,y -plane 
are known, for the moment of inertia © zz with respect to the z-axis we have 

= ®xx T - ®yy- 

If r v = ^/xf+y 2 is the distance of the mass element from the z-axis, we have 


®zz = Yl mvr v = E"v*v 2 + 

v v v 


1 Jacob Steiner , b. March 18, 1796, Utzenstorf-d. April 1, 1863, Bern. Steiner was son of a peasant 
and grew up without education. He received his first education from Pestalozzi in Yverdon. Subse¬ 
quently Steiner studied in Heidelberg, and then he served as a teacher of mathematics in Berlin; in 

1834, he became an associate professor at the university there. Steiner is considered the founder of 
synthetic geometry, which was systematically developed by him. He worked on geometric construc¬ 
tions and isoperimetric problems. A peculiar feature of his work is that he almost completely avoided 
analytic and algebraic methods in geometric investigations. 
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i.e., 

= ®yy "F • 

EXAMPLE 

11.2 Moment of Inertia of a Thin Rectangular Disk 

We consider the moment of inertia of a thin rectangular disk of density q. For the 
calculation of the moment of inertia about the v-axis, we take as the mass element 
dm = Qady. We then obtain 

b o 

f ry b 3 1 , 

&XX = y a Qdy=ag— = -Mb . 

0 

The moment about the y-axis follows likewise: 

1 ? 

Qyy = ~Ma . 

From © zz = ©** + @ yy , we then get 
® zz = l -M{a 2 +b 2 ). 

Fig. 11.6. A rectangular pla¬ 
nar mass distribution 


The moment of inertia about a perpendicular axis through the center of grav¬ 
ity is found, according to Steiner’s theorem, from the moment of inertia about the 
z-axis: 



®S =®zz-^ ( a 2 + b 2 ) = M(a 2 + b 2 ) Q - ^, 
®S = Y^( a 2 + b 2 ). 
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Mg 


Fig. 11.7. A body of mass M 
is suspended at the point P 


11.2 The Physical Pendulum 

An arbitrary rigid body with the center of gravity S is suspended revolving on an 
axis through the point P . The distance vector PS is r. Let ©o be the moment of inertia 
of the body about a horizontal axis through P, and let M be the total mass. If the body 
in the gravitation field is now displaced from its rest position, it performs pendulum 
motions. 

If the body is displaced, there is a torque 


D = ^r v xra y g = ^m y r y xg = Mrxg = 


—aMg sin^k, 


where k is a unit vector pointing out of the page in Fig. 11.7, and |r| = a. The angular 
velocity is then 


dip 

(o = +k—. 
dt 


From the relation D = L = ©o<u, we then obtain 
d 2 cp d 2 cp aMg 

—aMg sin = ©o or ——— sin<p = 0. 
dt z dt z ©o 

For small amplitudes, we replace sin ip by ip. With the abbreviation Q = 
^s/aMg/G o» we obtain the differential equation 




with the solution 


ip = A sin(£2 1 + 5). 


So one also obtains the period of the physical pendulum: 


T = 


2 JL = 2n J^L, 

£2 y Mag 


Since for the thread pendulum (mathematical pendulum) we have T = In^/TJg, 
it follows that both periods coincide if the thread pendulum has the length / = 
©o /Ma. 

If we replace the moment of inertia ©o by the moment of inertia © 5 about the 
center of gravity, then according to Steiner’s theorem we have 


T = T (a) = 2n 


l@ s + Ma 2 
Mag 


= In 



a 

H—. 
8 


From this, it follows that the period becomes a minimum if the vibration axis is a 
distance a = ^/® S /M from the center of gravity. From this relation one can experi¬ 
mentally determine the moment of inertia © 5 . 
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EXERCISE 

11.3 Moment of Inertia of a Sphere 

Problem. Find the moment of inertia of a sphere about an axis through its center. 
The radius of the sphere is a , and the homogeneous density is q. 

Solution. We use cylindrical coordinates (r, cp, z). The z-axis is the rotation axis. For 
the corresponding moment of inertia, we have 

® = Q J r 2 dV. 

sphere 

The center of the sphere is at z = 0. The equation for the spherical surface then reads 

2 , 2,2 2 2,2 2 

x + y + z = a or r + z =a . 


We write out the integration limits: 



Integration over z yields 

c 8 4 o 2 2 

® =-jr a J Q- a . 

Since the total mass of the sphere is given by M — (4/3)7ra 3 £, it follows that 

2 9 
©= - Ma 2 . 

EXERCISE 

11.4 Moment of Inertia of a Cube 

Problem. Calculate the moment of inertia of a homogeneous massive cube about one 
of its edges. 

Solution. Let q be the density and s the edge length of the cube. A mass element is 
then given by 


dm = QdV = Qdx dydz . 
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Fig. 11.8. Calculation of the 
moment of inertia of a cube 


Fig. 11.9. Rotation axes of the 
suspended cube 


The moment of inertia about A B (see Fig. 1 1.8) is evaluated as 


s s s 

®ab=qJ J J (x 2 + y 2 )dxdydz = 

0 0 0 




EXERCISE 


11.5 Vibrations of a Suspended Cube 

Problem. A cube of edge length s and mass M hangs vertically down from one of its 
edges. Find the period for small vibrations about the equilibrium position. How long 
is the equivalent thread pendulum? 


B 



Solution. The moment of inertia of the cube about A B is (see Exercise 1 1.4) 

&AB = \MS 2 . 

The center of gravity is in the center of the cube, i.e., for the distance a of the center 
of gravity S from the axis AB we have 


a = -sV2. 

2 

The equation of motion of the physical pendulum for small angle amplitudes was 
Mga 


<P + 


~(p = 0 


®AB 

with the angular frequency 



and the period 

T= ^ =2 ,yn =1 ,j^i 

co V Mga Y 3MgsV2 
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The length of the equivalent thread pendulum is calculated as 

T = r = 2nf-, 

which just defines the equivalence of the pendulums. By insertion of T one obtains 



or resolved, 

/ = -Vis. 

3 

This equivalent length of the thread pendulum is also called the reduced pendulum 
length. 


EXAMPLE 


11.6 Roll off of a Cylinder: Rolling Pendulum 

We consider a cylinder with a horizontal axis that can roll down an inclined plane. 
The system has one degree of freedom; hence an energy consideration is sufficient. 
The velocity of each point of the cylinder may be thought as being composed of the 
velocity Vi due to the translational motion and of the velocity V 2 v due to the rotation. 
The energy of motion is then given by 

E m v 2 1 2 i Mv 2 , /n n 

-y v v = 2 Vl Z^ m ’' + Z^2r v 2 l ' + v i- 2 ^ mv ' /2v - (1L1) 


For a symmetric mass distribution, the last term drops out, and we have 


M 


© 


r=-vf+-^ 


( 11 . 2 ) 


i.e., the energy of motion is additive in translational and rotation energy. For the cylin¬ 
der (with symmetric mass distribution) on the inclined plane we have 


M, © . 2 

—s + —cp — Mgs sma = E 


(11.3) 


(s measures the distance along the inclined plane). “Rolling off” without gliding 
means that the axis always moves just as much as corresponds to the rotation of the 
cylinder surface: 


Exercise 11.5 



Fig. 11 . 10 . Physical pendulum 
and reduced pendulum length 


s = Rep, 
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Example 11.6 


Fig. 11.11. Rolling 
on an inclined plane 


where R is the cylinder radius. We thus obtain the equation 


1 


0 


R 2 


- ( M -f- j — Mgs sin a = E, 


(11.4) 


.v = 


1 + 0/M/? 2 


g smckf. 


The acceleration of the cylinder rolling off is smaller than that of a gliding mass point. 
If the total mass of the cylinder is (approximately) concentrated on the axis, then 


0 

MR 2 


= 0, 


s = g sin a, 


and the acceleration is the same as for a gliding mass point. For a homogeneous cylin¬ 
der, we have 


0 _ 1 
MR 2 ~ 2’ 


s = -g sin a. 
3 


For a hollow cylinder with all mass on the surface, we have 


0 

MR 2 


s = -g sina; 
2 


the acceleration is only half of that for a gliding mass point. If we fix a circular disk 
concentric onto the cylinder, which extends beyond the base (like a wheel rim over the 
rail), then 0/M/? 2 > 1, i.e., the acceleration can be even lower. 

An investigation of the force balance lets us elucidate this problem once again from 
another point of view. At the point S , gravity acts and performs a torque with respect 
to the point A (see Fig. 11.11) 


Da = |Da| = R • Mg sin a, 


(11.5) 


while the constraints do not create a torque. The angular acceleration at the point A is 
therefore 


Da RMg sin a 2 g 

ip = co = -=- a =-sin a. 

r 0 A (3/2)M/? 2 3 R 


( 11 . 6 ) 


cylinder 














11.2 The Physical Pendulum 


171 


The moment of inertia ©a of a homogeneous cylinder is easily found by means of 
Steiner’s theorem. Since the moment of inertia with respect to the center of gravity is 
© 5 = MR 2 / 2, it follows immediately that 

3 

© A = & S +MR 2 = -MR 2 . 

If the cylinder rolls without gliding, for the linear acceleration of the center of 
gravity, we find 


\a s \ = \co x ta\ = coR = -g sin a. (11.7) 

The cylinder gets only 2/3 of the acceleration which it would get when gliding. Equa¬ 
tion (11.8) is found from simple considerations: Since the instantaneous velocity of 
the contact point A equals zero, one can consider A as instantaneously at rest. But this 
means that the rigid body instantaneously performs a rotation about the contact point 
A, with an angular velocity co. The velocity of an arbitrary point of the body is then 
given by (see Fig. 11.11) 


y = co x r a . 

Besides the gravitation force there acts the reaction force N (to balance the normal 
component of Mg) 

|N| = \Mg cos a |, (11.8) 

and the friction force Ff . The latter one is calculated from the balance 

Mg sin a + Ff = Ma s (11.9) 

and with (11.7), 

2 1 

— Ff — Mg sin a — -Mg s i na — - Mg sin cy. (11.10) 

Thus, the friction force acts opposite to the direction of motion. The condition for a 
rolling motion of the cylinder is 

\F f \<iiN, (11.11) 

where fi is the friction coefficient. 

Since 

1 

N = Mg cosa and \Ff\ = -Mg sina, 
we have 

1 

\F/\ < nMg cos a or -Mg sina < [i Mg cos a. (11.12) 

That means that a rolling motion exists only for tana < 3 /i. 


Example 11.6 
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A cylinder with asymmetrical mass distribution, which under the influence of grav¬ 
itation can vibrate by rolling on a horizontal base, is called a rolling pendulum. It 
represents a system with one degree of freedom; the position of the rolling pendu¬ 
lum can be specified by the rotation angle <p or by the coordinate v of the cylin¬ 
der axis (measured perpendicularly to the axis, see Fig. 11.12). “Rolling off” means 
Fig. 11 . 12 . Rolling pendulum that 



x — R(f. 


(11.13) 


Since there is only one degree of freedom, the energy law is sufficient for the de¬ 
scription. The motion is composed of a translational and a rotational motion. When 
applying (11.1), we have to account for the asymmetrical mass distribution. The ex¬ 
pression ^m y v 2 v, as a momentum due to the rotational motion, can be calculated 
by assuming that the total mass M is concentrated at the center of gravity, which is 
located off the axis by the distance s: \s<p\ is then the velocity |v2v I of this mass on 
rotation, and Tt — ip is the angle between vi and V 2 v . According to (11.1), we then 
have 


T = 


M .2 . ® *2 

—x H- (p 

2 2 


x • Mscpcoscp, 


where © is the moment of inertia about the cylinder axis. With the condition (11.13) 
for rolling follows 


T = - ( MR 2 + 0 — 2 MRs cos (p)cp 2 


(11.14) 


This expression can be interpreted also in a different way: If © 5 is the moment of 
inertia about an axis through the center of gravity which is parallel to the cylinder 
axis, then according to Steiner’s theorem we have 


@ = @ S +Ms 2 . 


Equation (11.14) thus turns into 

T = - [M(R 2 + s 2 — 2 Rs cos (p) + © 5 ]^ 2 
or 

T= l - (Mr 2 + & s )<p 2 , 

where r is the distance of the center of gravity from the contact line of the cylinder 
with the base. According to Steiner’s theorem, 

©h = Mr 2 + 0, 
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is the moment of inertia about the contact line which changes with time, and (11.14) 
takes the form 

T -2 
T = —cp . 

2 

If we now abbreviate (1 1.14) by 

1 .2 

T = - (B — 2 MRs cos cp)cp z 

and add the potential energy 
U = Mgs (l — cos cp), 
we then get the energy law 


1 9 

-(B — 2MRs cos cp)cp z + Mgs( 1 — cos<p) = E. (11.15) 

The equation differs from that for the physical pendulum (compare the section on 
the physical pendulum). For small angles cp, we obtain 

® u cp 2 + Mgscp 2 = Mgsa 2 , (11.16) 

where we replaced the arbitrary constant E by the arbitrary constant a. Equation 
(11.16) is solved by 

<p = a cos(cot + 5), (11.17) 


with 


2 M S S 

CD = - 


Mgs 

MR 2 + 0 - 2 MRs ' 


(11.18) 


In the limit of a symmetrical mass distribution (s = 0), one has co = 0. If the center of 
gravity moves to the cylinder surface (s — R), then 

2 m 8 r 
CD = —— . 

®s 

If the mass is limited to a more restricted region, cd becomes very large. If we imagine 
that a part of the mass is shifted by an appropriate device to the outside of the rolling 
cylinder (see Fig. 11.13) and that s is large compared to R , then the vibration turns 
into the vibration of a physical pendulum. 



Fig. 11 . 13 . Transition from the 
rolling pendulum to the phys¬ 
ical pendulum 
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EXAMPLE 


11.7 Moments of Inertia of Several Rigid Bodies About Selected Axes 

Figure 11.14 shows the moments of inertia of (a) a disk, (b) a cylinder, (c) a rec¬ 
tangular plate, (d) a spherical shell, (e) a solid sphere, and (f) a cube about different 
selected axes. 


Fig. 11.14. 
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EXERCISE 


11.8 Cube Tilts over the Edge of a Table 


Problem. A cube with the edge length 2 a and mass M glides with constant velocity 
vo on a frictionless plate. At the end of the plate, it bumps against an obstacle and tilts 
over the edge (see Fig. 11.15). Find the minimum velocity t>o for which the cube still 
falls from the plate! 




Fig. 11.15. Cube tilting over 
an edge 


Solution. We look for the velocity vo for which the cube can tilt over its edge, as is 
represented in (c). If it bumps into the obstacle at the edge of the plate, it is set into 
rotation about the axis A. At the time of collision all external forces act along this 
axis, and the angular momentum of the cube is conserved. Before hitting the obstacle, 
the cube has—due to the translational motion—the angular momentum 


L = \rxp\=p-a = Mvoa. (11.19) 

Immediately after the collision, the angular momentum appears as rotational motion 
of the cube 

L = ©a&>0 = Mvoa , 


or 


CVO = 


Mvoa 

~&A~' 


( 11 . 20 ) 


If the cube begins to lift off, the gravitational force causes a torque about the axis A 
that counteracts the lifting process. 

For the kinetic energy of the cube immediately after the collision, one has, for given 

a>o, 


To 




1 M 2 vla 2 


@A 


( 11 . 21 ) 


The potential energy difference between position a and position c is 

A V = M(h 2 -hi)g = M(V2a -a)g = Mag(V2 - 1), (11.22) 


and from the energy conservation law, immediately it follows that 

r 1 M 2 via 2 

Mag(J 2—D = - 0 . 

2 ©A 


(11.23) 
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Exercise 11.8 


The moment of inertia of the cube ©a is easy to calculate: 

2 a 2 a 2 a 



0 0 0 


(,x 2 + y 2 ) dx dy dz = -Ma 2 . 


(11.24) 


From (11.23), it follows that 



and from this, we find 



(11.25) 


This is the correct result. We emphasize this because one could easily come to another 
result by a false consideration : The kinetic energy is (1/2 )Mvq, and from the energy 
conservation combined with (11.22) it follows that 



This leads to 



(11.26) 


i.e., a value that is smaller than the correct result (11.25) by the factor ^3/8. The 
result (11.26) is wrong since the cube loses part of its kinetic energy in the collision 
because of its inelasticity. The correct result (11.25) is based upon the conservation of 
angular momentum, which acts “more strongly” than the conservation of energy. 

EXERCISE 

11.9 Hockey Puck Hits a Bar 

Problem. A thin bar of length / and mass M lies on a frictionless plate (the x , y-plane 
in Fig. 11.16). A hockey puck of mass m and velocity v knocks the bar elastically 
under 90° at the distance d from the center of gravity. After the collision the puck is 
at rest. 

(a) Determine the motion of the bar. 

(b) Calculate the ratio m/M, accounting for the fact that the puck is at rest. 


Fig. 11.16. A hockey puck 
hits a bar on a frictionless 
plate 


2 


2 



X 
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Solution, (a) Since the collision is elastic, momentum and energy conservation hold, Exercise 11.9 

where momentum conservation refers both to linear and angular momentum. The bar 
acquires both a translational and a rotational motion from the collision with the puck. 

Conservation of the linear momentum immediately leads to 


P s = Mv s = mv, 

and the velocity of the center of gravity is 
mv 



Likewise, from the conservation of angular momentum it follows that 
L s = 0 s co = mvd = D, 


(11.27) 


(11.28) 


(11.29) 


and for the angular velocity of the bar relative to the center of gravity, 


mvd 
Ml 2 /12’ 


(11.30) 


where 


1/2 

& s = J gr 2 dV = 2-Ml 2 . 

0 

Thus, the center of gravity of the bar moves uniformly with v s along the y-axis, while 
the bar rotates with the angular velocity co about the center of gravity. Figure 11.17 
illustrates several stages of the motion. 

(b) The kinetic energy of the bar can be determined by means of the energy con¬ 
servation law. Before the collision, the kinetic energy of the puck is 



(D i =i 

Fig. 11.17. The motion of the 
bar 



while the kinetic energy after the collision consists of two components: 
1 9 

T t = -Mv^ “translation energy of the center of gravity” 
and 


(11.31) 


T r = 



“rotation energy about the center of gravity.” 


Since the potential energy remains unchanged, it immediately follows that 


T = l -mv 2 = T t + T r = l -(Mv 2 + & s co 2 ) 


or 


mv 2 = Mv 2 + 


Ml 2 

~V2 


(11.32) 
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Exercise 11.9 


Insertion of (11.28) and (11.30) into (11.32) finally yields 
m 2 v 2 d 2 ( 12) 2 Ml 2 


mv 2 = 


2 2 
m v 


1 = 


M 

m 

M 4 


12 


M 2 / 4 
m d 2 
M T 2 "’ 


12 ’ 


or 


m 1 

M ~ 1 + 12 ( J //) 2 

for the mass ratio. If the puck kicks the bar at the center of gravity, d = 0, no rotation 
appears. In order to make the collision elastic, m — M must be satisfied. 

If the puck kicks the bar at the point d = 1/2, the collision is elastic only if M = 4m. 
In this case the rotation velocity is co = 6mv/Ml = 6v s /l. 


EXERCISE 


11.10 Cue Pushes a Billiard Ball 


Problem. A billiard ball of mass M and radius R is pushed by a cue so that the center 
of gravity of the ball gets the velocity vo. The momentum direction passes through the 
center of gravity. The friction coefficient between table and ball is /i. How far does 
the ball move before the initial gliding motion changes to a pure rolling motion? 


Fig. 11.18. A cue pushes a 
billiard ball 



Solution. Since the momentum direction passes through the center of gravity, the 
angular momentum with respect to the center of gravity at the time t = 0 equals zero. 
The friction force f points opposite to the direction of motion (see Fig. 11.18) and 
causes a torque about the center of gravity 


D s = f • R = iiMgR. 


(11.33) 


The result is an angular acceleration of the ball, so that 

fiMgR fiMgR 5 gig 
0, “ (2/5 )MR 2 “27' 


(11.34) 


Moreover, the friction force causes a deceleration of the center of gravity, i.e., 


Ma s = —f or a s = — 


M 


M 


(11.35) 


a s is the acceleration of the center of gravity. 
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For the rotation velocity of the ball, one gets from (11.34) —after performing the 
integration— 


t 



o 


(11.36) 


The linear velocity of the center of gravity follows from (11.35) —again after 
integration—as 


Vs 


J a s dt = D 0 -ligt. 


(11.37) 


The billiard ball begins rolling when v s = coR, or 

^H^ tR = v 0-gut 
Z K 


(11.38) 


or when 


Exercise 11.10 


7 . . . 2 v 0 

vo=~/*gt and t = - —. 
2 7 tig 


(11.39) 


The distance passed before rolling starts is obtained—by integrating (1 1.37) —as 


i 

/ 


s = / v s dt = vot — 




and with t from (1 1.39) finally as 


2 vi 


1 \xg 2fig \7J 49 fig' 


12 vi 


(11.40) 


(11.41) 


If the ball is kicked at a distance h above the center of gravity, besides the linear 
motion there appears a rotational motion with the angular velocity 


co = 


Mvoh 

0 


5 no h 
2~R^ 


(11.42) 


If h = (2/5)/?, the rolling motion of the ball starts immediately. For h < (2/5)/?, one 
has co < vo/R, and for h > (2/5)/? correspondingly co > no//?; in the second case the 
friction force points forward. 

Figure 11.19 shows the change of v s and coR as a function of time for h = 0. If 
v s = coR , the rolling motion begins, the friction vanishes, and then v s and co remain 
constant. 
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EXERCISE 


11.11 Motion with Constraints 


Problem. A bar of length 21 and mass M is fixed at point A, so that it can rotate only 
in the vertical plane (see Fig. 11 .20). The external force F acts on the center of gravity. 
Calculate the reaction force F r at the point A! 


Fig. 11.20. The bar rotates 
about the point A 


■«- 21 - 

i k. 1 

i 

r 

*- / -► 

1 

| 1 

1 ■ 

| 1 

0 1 

5 ) 


F 


Solution. In order to determine F r , one calculates the torque Da with respect to the 
center of gravity of the bar, caused by F r . 

The torque with respect to the fixed point A is 


Da = —FI = F) a co, 


(11.43) 


since the constraints do not contribute to Da . The angular acceleration of the bar co 
then follows from (1 1.43): 



FI 

©? 


(11.44) 


where ©a is the moment of inertia of the bar with respect to A. Since the moment of 
inertia & s with respect to the center of gravity S is easily calculated as 


/ 



-/ 


© 


(11.45) 
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one immediately gets for ©a by means of Steiner’s theorem 

& A = @ s +Ml 2 = ^Ml 2 + Ml 2 = ^Ml 2 . (11.46) 

Equation (11.46) inserted into (11.44) leads to 


FI _ 3 F 

©I 


(11.47) 


Since (11.47) must be correct, independent of the point from which the torque is being 
calculated, from the knowledge of the torque with respect to the center of gravity S , 


D s = —F r l, 


(11.48) 


and hence of the angular acceleration 

. _ D s _ 3F r l _ 3 F r 

c °~ 


(11.49) 


one can calculate the reaction force F r , by comparing (11.47) and (11.49): 


3 F 3F r 1 

_ __ _v p — F 

4 Ml ~ Ml r ~ 4 


EXERCISE 


11.12 Bar Vibrates on Springs 


Problem. 

(a) Find the moment of inertia of a thin homogeneous bar of length L with respect to 
an axis perpendicular to the bar. 

(b) A homogeneous bar of length L and mass m is supported at the ends by identical 
springs (spring constant k). The bar is moved at one end by a small displacement 
a and then released. 

Solve the equation of motion and determine the normal frequencies and normal 
vibrations. Sketch the normal vibrations. 


in, L 



Fig. 11.21. A bar is supported 
by two identical springs 


Solution, (a) If the bar is divided into small segments of length dx with the cross 
section /, we have elementary volumes dV = f dx. Let q be the constant density of 
the bar; then we have 
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Fig. 11.22. 


y“ 

f dx 
A ® N) 

L 


x 



L 

= Qf J x 2 dx = ^QfL 3 - 

o 


Since m = q/L is the total mass of the bar, it follows that 

© A = -mL 1 . 

3 

According to Steiner’s theorem, the moment of inertia about an axis through the 
center of gravity is 

®a = ®s+m(— ^ => ®s = — mL 2 . 


(b) Let b be the length of the spring before the motion (b is not the natural length 
of the spring, because of the existence of the gravitation field), and x\,X 2 ,x be the 
lengths of the first and second spring, and the height of the center of gravity of the bar 
at the time t. Since the bar is rigid, we have x\ + X 2 = 2x. Newton’s second law leads 
to 


mx = —k(x\ — b) — k{x 2 — b) 


or 


mx = —k{x\ + X 2 ) + 2 kb. 

The constraint condition leads to 

2k 

mx = —2 kx + 2 kb =>- x =- (x — b). (11.50) 

m 

We assume that there are only small displacements, so that sin ft ~ ft. Then 
L L 

X2 = x + —d, X\ = x — 

For the torque, we get 

k 1 2 

0# = — L(x 2 —xi) = —kL d, since X 2 — x\ = Ld. 

2 2 



Fig. 11.23. 
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From (a) © = (1/12)mL 2 , we conclude 
6k 

ft = - ft. 

m 

The solutions of (11.50) and (11.51) are 
x = A cos(co\t + B) + b 


and 

ft = C cos(&>2^ + D) 
with 


2k 6k 

qj\ = J — and 0)2 = J —. 

V m V m 

The initial conditions at the time t = 0 are 


x — b -, 

2 


a 

ft = ~, 
L 


x = 0. 


ft =0. 


Thus follows 


B = D = 0, 

A = -~, 

2 

a 

C =L’ 


b — - = Acos(B) + b, 
0 = —Aoj\ sin (B), 


— = C cos (D), 

0 = —Ccl >2 sin (D), 


and, hence, 


a 2k 

x = b -cos A / — t, 

2 V m 


a 6k 
ft = — cos J — t. 
L V m 


The normal modes are 


X\ = x\ +X 2 = 2b — a cos J — t, 

m 


/ 6k 

X 2 = x\ — X 2 = —a cos J — t . 

V m 


(11.51) 



X x = 0 => x 2 = -x x X 2 = 0 => x 2 = x 


Fig. 11.24. The normal vibra¬ 
tions 
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The general motion of a rigid body can be described as a translation and a rotation 
about a point of the body. This is just the content of Chasles’ theorem, discussed at the 
begin of Chap. 4. If the origin of the body-fixed coordinate system is set at the center 
of gravity of the body, one can separate the center-of-mass motion and the rotation 
in all practical cases (compare Chap. 6, equations (6.4)—(6.8)). For this reason, the 
rotation of a rigid body about a fixed point is of particular significance. 


12.1 Tensor of Inertia 

We first consider the angular momentum of a rigid body that rotates with angular 
velocity oo about the fixed point 0 (see Fig. 12.1): 

L = ^m v (r v x v v ) 




V 


the latter relation holds according to the expansion rule. We decompose r v and oo into 
components and insert 



Fig. 12.1. A rigid body rotates 
with oo about the fixed point 0 


^ = ^m v ((xl + yl + z 2 v )((o x , co y ,(o z ) - (x v co x + y v co y + z v m z )(x v , y v , z v ))- 


V 


Ordering by components leads to 



V 


+ (C*v + yl + zl)co y - yl(Oy - x v y v a> x - z v y v co z )e y 
+ (Ov + yl + ~ x v z v o) x - y v z v co y )e z ). 
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12 Rotation About a Point 



Fig. 12.2. The angular momen¬ 
tum is in general not parallel 
to the angular velocity oo 



Fig. 12.3. Possible rotation 
points and coordinate systems 
of the rigid body 


For the components of the angular momentum one thus obtains 




L x — ( ^ ' m v (y v + z v ) jco x + f ^ ' m v x v y v jcDy + f ^ ' m v x v z 

' V ' ' v ' ' y 

Ly = (~y2 m v x vyv + [y^mvixl + yl)\a>y + ( - 'Yh mvyvZv F z ’ 

l z = {-'y2 mvXvZv ) c °x + [-'%2 m vyvZv) M y + ( Yl mv ^ x v + yln M z- 

\y / \y / \y / 

The individual sums are abbreviated by 


L X - ® XX CO X + (-) X yCOy + ® xz co z , 

Ly — ©VX*^X + © -y-y COy ^)y Z CO Z , 

L z = H“ ©zv^V ®zz<^Z’ 


or 


L^ = 2_^© M v^v, 

V 

or, written in terms of vectors (matrix notation), 

L = 0 • oo. 

The quantities ©^y are the elements of the tensor of inertia © that can be written as a 
3 x 3-matrix: 

/©xx ©XV ©xz\ 

© = ©yx ©yy ©yz • 

\ ®zx ©zy ©zz / 

The elements in the main diagonal are called moments of inertia , the remaining 

ones are called deviation moments. The matrix is symmetric , i.e., © v/x = ©^y. Thus 
the tensor of inertia has 6 independent components. If the mass is continuously distrib¬ 
uted, one changes from summation to integration for calculating the matrix elements. 
For example, 


©xv = 


j Q(r)xydV, 
v 


© 


XX 


f e(r)(y 2 + z 2 )dV, 
V 


where q (r) is the space-dependent density. 

At each point Oo, Oi, O 2 ,... the tensor of inertia ©^y is different. At a fixed point 0 
©^v also depends on the coordinate system. 

As follows from their definition, the ©^y are constants if one selects a body-fixed 
coordinate system. The tensor of inertia is however dependent on the position of the 
coordinate system relative to the body and will change if the origin is shifted or the 
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orientation of the axes is changed. The tensor of inertia is usually understood as the 
tensor in a coordinate system with the origin in the center of gravity (center-of-mass 
system). The corresponding principal moments of inertia (see Fig. 12.3) are corre¬ 
spondingly the moments of inertia. 


12.2 Kinetic Energy of a Rotating Rigid Body 

Quite generally the kinetic energy of a system of mass points is 
T = 

V 

We decompose the motion of the rigid body into the translation of a point and the 
rotation about this point, so that v v = V + co x r v , and we obtain 

T = ^ ^m y (V + (o x r v ) 2 

V 

= ^ mv 2 + v (w x 

' V 

The first and the last term correspond to pure translational and rotational energy, re¬ 
spectively. The mixed term can be made to vanish in two different ways. 

If one point is fixed, and if we put it at the origin of the body-fixed coordinate 
system, then V = 0. Otherwise the origin is put at the center of gravity, so that 


m v r v j + ]- ^2m v ((o x r v ) 2 . 


J> v r v — 0. 

V 

The rotation point is in this case the center of gravity. We now consider the pure 
rotation energy 

1 ^ 

T = - 2_^m v (o) x r v ) • (co x r v ) = - ^m v co • (r v x (wx r v )) 

V V 

= ift) • y^my(r„ X Vy) = \-(o ■ ^r v x p v = ]-co- y~%. 


Hence, 


T = -(o L. 
2 


We can substitute the angular momentum = J2v ®/zv&>v (/x, v = 1, 2, 3): 

T = —(& • L = — ^ ' cOfx y ' = — y ' ®i lv CD/jCDy . (12.1a) 


/X V /X,V 

Because ©^y = © v/x , the sum reads 

T — — (® XX CD X + ©jjUXy + ©zz<X> z T 2© X jCU x CDy + 2© xz 6Q x + 20y Z fr>y<x> z ) . 


(12.1b) 
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Using tensor notation, the rotation energy reads 


1 T ^ 

T = -co • © • co. 
2 


(12.1c) 


The vector co on the right-hand side of the tensor © must be given as a column vector, 
and on the left-hand side as a row vector: 



(12.Id) 


12.3 The Principal Axes of Inertia 

The elements of the tensor of inertia depend on the position of the origin and on the 
orientation of the (body-fixed) coordinate system. It is now possible for a fixed origin 
to orient the coordinate system in such a way that the deviation moments vanish. Such 
a special coordinate system is called a system of principal axes. The tensor of inertia 
then has diagonal form with respect to this system of axes: 



( 12 . 2 ) 


For angular momenta and rotation energy in the system of principal axes , we have 
the especially simple relations (co v are the components of the angular velocity co with 
respect to the principal axes) 



(12.3) 


V 


V 



(12.4a) 


/X 

or written out, 


/X 



(12.4b) 


Because of the tensorial relation L = Geo, the angular momentum and the angular 


velocity have different orientations. 

If the body rotates about one of the principal axes of inertia, e.g. about the /x-axis, 
co = coe then (because in this example co = coe^) according to (12.3) the angular 
momentum L and the angular velocity co have the same orientation. The vector co then 
has only one component, co = (0, 002 , 0), if the rotation is about the second principal 
axis. The same holds also for the angular momentum: L = (0, L 2 , 0). This property 
of parallelism between the angular momentum and the angular velocity allows one to 
determine the principal axes. The question is namely how to choose co = {co\ , 602 , 603 } 
(about which axis must the body rotate), in order to get the angular momentum 
L = ©a) and the angular velocity parallel to each other, i.e., L = Geo, with © a scalar. 
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From the combination of the relations L = 0cu (0 is a tensor) and L = 0w (0 is a 
scalar), we obtain the equation 

L = 0.<u = 0<y, (12.5) 

which is an eigenvalue equation. In this equation, the scalar 0 and the related compo¬ 
nents co x ,co y ,co z , i.e., the rotation axis, are unknown. The equation physically states 
that the angular momentum L and the rotation velocity co are parallel to each other. 
This is fulfilled for certain directions co that—as stated above—must be determined. 
All values 0 that satisfy (12.5) are called eigenvalues of the tensor 0; the correspond¬ 
ing vectors co / 0 are eigenvectors. 

Equation (12.5) is a shortened notation for the system of equations 

®xx<*>x + ®xyMy + = ©<«z> 

®yX&*X + QyyCOy T~ &y Z CO z = @COy , (12.6) 

®ZX M X T ® Z y(L)y + ©zZ^Z = 0 CO z , 

or 

(©xx — ©)^x H” ©xy^y ©xz^z = 0, 

©yx^x H - (®yy ~~ ©)&>y + ®y Z CO z = 0, (12.7) 

©zx*>x + ® Z yCQ y + (©zz - @)CD Z = 0. 

This system of homogeneous linear equations has nontrivial solutions if its determi¬ 
nant of coefficients vanishes: 


®xx~® 

&xy 

©xz 



&yx 

0 

v 

1 

0 

©yz 

= 0. 

(12.8) 

®zx 

©zy 

0 

fNl 

l 

© 




The expansion of the determinant leads to an equation of third order in 0, the char¬ 
acteristic equation. Its three roots are the desired principal moments of inertia (eigen¬ 
values) 0i, 02, and 03 . By inserting 0/ into the system of (12.5), one can calculate 
the ratio co x ^ : aty : co z ^ of the components of the vector co^ l \ Thereby the orientation 
of the i th principal axis is determined. 

Since one can find a tensor of inertia for any possible position of the body-fixed 
coordinate system, there exists also a system of principal axes at each point of the 
body. The orientations of these axes will however not coincide in general. 


12.4 Existence and Orthogonality of the Principal Axes 

In principle, it would be possible for the cubic equation (12.8) to have two complex 
solutions. We therefore have to prove that a system of real orthogonal principal axes 
generally exists. 

In order to apply a shortened summation notation, we number the coordinates 
(x = 1, y = 2, z = 3) and denote them by Latin letters. Greek letters are indices for 
the three different eigenvalues. We multiply the eigenvalue equation (12.5) for 0^ by 
the complex conjugated of cof^ and sum over i. 



Fig. 12.4. Special case: If co 
is parallel to a principal axis, 
then L is parallel to co 



Fig. 12.5. General case: The 
angular momentum L is not 
parallel to the rotation veloc¬ 
ity co 
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The equation for the component i reads 


k 

(12.9) 

This leads to 


E ®ikO)f ) 0)^* = ©A = ©A 60 {X) • 60 ( ^*. 

i,k i 

(12.10) 

In the same way, we form the complex conjugated of the equation corresponding to 
(12.9) for 0^, multiply by co^\ and sum over k : 

\ A rr\ , Xd) rr\ , Sd) \ ' , Xl 1 )* Gl\*sSt L )* 

2_^WkiO)i -®nO) k , -V^k ’ 

i i 

(12.11) 

E 

i,k k 

(12.12) 


Now we utilize the property of the tensor of inertia to be real and symmetric. We have 
= ©^ = ©|., and the left-hand sides of (12.10) and (12.12) are equal to each 
other. We subtract (12.12) from (12.10): 

(©a - ©* )« (A) • « (M) * = 0. (12.13) 

This equation allows two conclusions: 

(1) Setting X = /jl, then for the eigenvalues of 

(0; - &?)<o a) ■ (0 a> * = 0 (12.14) 

follows the relation ©^ = ©*, since the scalar product of two complex conjugated 
quantities is positively definite. 

We thus proved that ©^ is real. Hence, any body always has three real principal 
moments of inertia and therefore also three real principal axes . This is of 
course physically clear from the outset, since the principal moments of inertia are 
nothing else but the moments of inertia about the principal axes, and therefore 
they are always real. 

(2) We now consider the case A^/x: Since all © v and therefore also all co v are real, 
(12.13) reads 

(©i - 0^)® W • = 0. (12.15) 

(a) If ©a / 0 /x , then bo ik) ■ « (,<l = 0, and therefore, m ,/ } and « (/l) are orthogonal. 

(b) If, e.g., ©i = ©2 = ©, i.e., if two of the three eigenvalues are equal, then 
besides a/ 1 ) and co ^ all linear combinations of these two vectors are eigen¬ 
vectors, too: 

@■^ = @(0^, 0.®® = 0®® 

=> © • (aw (1) + /3co^) = @(aw (1 ^ + /3co^). 
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Thus, we can arbitrarily select two orthogonal vectors from the plane spanned 
this way and consider them as directions of principal axes. The third principal 
axis is by (12.15) fixed orthogonally to the two other axes. If two principal 
moments of inertia with respect to the center of gravity as rotation point are 
equal, the body is called a symmetric top. 

(c) If all three moments of inertia are equal (©i = © 2 = © 3 ), then any arbitrary 
orthogonal set of axes is a system of principal axes. If this holds with respect 
to the center of gravity, the body is called a spherical top. 

If a body has rotational symmetry about one axis, then we are dealing with case (b), 
and the rotation axis is a principal axis. For other kinds of symmetries the symmetry 
axis also coincides with the principal axis. 


EXAMPLE 


12.1 Tensor of Inertia of a Square Covered with Mass 


We calculate the tensor of inertia and the principal axes of inertia of a square covered 
with mass for a comer of the square. We put the square in the x, y-plane of the coor¬ 
dinate system, as is shown in Fig. 12.6. The components of the tensor of inertia are 
obtained with z = 0 by integration over the area: 


©rr = 


©_y_y = G 


-•II 

= 0*=0 

a a 

// 


y 2 dxdy = ^Ma 2 , 


9 1 9 

v dxdy = -Me/ , 


>-= 0^=0 
a a 


© 

7 


-II 


(.x 2 + y 2 ) dx dy = -Ma 2 . 



Fig. 12.6. The angular veloc¬ 
ity co is arbitrary; however, it 
passes through the coordinate 
origin 


_y=0;t=0 


Likewise, 


© — © 


a a 

-II 


xy dx dy = — Ma 2 . 
4 


_y=0x=0 


The remaining deviation moments contain the factor z in the integrand and therefore 
vanish: 


©j Z — © Z j — © xz — © Z x 


= 0 . 


Thus, in the selected coordinate system the plate has the following tensor of inertia: 
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Example 12.1 




1 ? 

— Ma 2 
4 

1 9 

- Ma 2 
3 


\ 0 0 


0 


\ 


0 



We now calculate the orientations of the principal axes. 

In accordance with the described approach, we first determine the eigenvalues of 
the tensor of inertia. We introduce the abbreviation ©o = Ma 2 . Then we have the 
determinant 


1 


-©0 — © 
1 

-4 00 

0 


- 0 o 


-©o - 0 


2 

-0o - © 


or 


02 -5 e ° e + T5 e » 


5 e °- e 


: 0 . 


The roots of this characteristic equation 

©l = T2® 0 ’ ® 2= 12 ® 0, ®3 = -®o 

are the principal moments of inertia with respect to the origin. 

For the principal moment of inertia © v , the orientation of the axis co^ results from 
the eigenvalue equation ©o/ y ) = © v a/ y \ 

Written out for v = 1, 

/ 1 1 
3 00 -; 00 

-5 00 ^ 0o 

v 0 0 

By multiplying out, we get a vector equation; after splitting into the three compo¬ 
nents, we obtain the three equations 

100^ - i©0< = T©o 4V 
-l0o^) + l© o < = l0 o ^), 


3®°/ 


/77\ 


'X 
(!) 
y 

(i) 

'z 


W7 


/ 77 \ 


T 2 00 


’x 

(1) 

'y 

(i) 

’z 


V7‘7 


From this, it follows that 
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and thus, the orientation of the first principal axis is 


e l 


|®(D| 



Example 12.1 



Fig. 12.7. The principal axes 
for rotations about the point 0 


Analogously, we obtain for the two other directions 


00 


( 2 ) 


o(2)| 



and 


oo 


(3) 


>(3)| 



Evidently, the principal axes are orthogonal to each other, as is demanded by the gen¬ 
eral theory. For a rotation about the point 0 around one of the principal axes, the 
angular momentum L is parallel to oo , but in general the center of gravity then also 
moves. Such a motion can be forced only by the action of a force. Thus it is no free 
motion. Force-free rotations (shortly: free rotation) take place only about the center of 
gravity. The principal axes moments or principal moments of inertia about the center 
of gravity are the principal moments of inertia or principal axes of the body. In our 
example the orientations of the principal axes coincide with those at the point 0. 


12.5 Transformation of the Tensor of Inertia 

We investigate how the elements of the tensor © behave under a rotation of the coor¬ 
dinate system. The transformation of a vector under rotation of the coordinate system 
is described by 1 

x' = Ax 

or 

x[ = ^ atjXj , (12.16) 

j 

or for the basis vectors 

e;=E a '7 e 7’ (12-17) 

j 


1 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 
(2004), Chapter 6. 
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where the components aij of the rotation matrix A are the direction cosines between 
the rotated and the old axes. The inverse of this transformation reads 


x = A V or x i ='Y^a ji x' j . 


(12.18) 


The inverse rotation matrix {a l )ij = (aji) is found by exchanging rows and columns 
(transposition), since the rotation is an orthogonal transformation which satisfies 


^ ' atj^kj — $ik or ^ ' &ijCiik — 8jk . 


(12.19) 


We require for the tensor of inertia that a vector equation of the form 

Lk =y 

/ 

exists also in the rotated system: 

*■:•=! y,A- 


( 12 . 20 ) 


( 12 . 21 ) 


Thus, we can determine the transformation behavior of the tensor from the behavior 
of the vectors. The vectors L and co obey the transformation equation (12.18). If we 
replace L k and coi in (12.20) by the primed quantities, we obtain 


r. a ji c °j )—* 

/ v j 7 j 

Multiplication by at k and summation over k yields 

y^f y ^ a ikajl®kl ju>j = Y ( Y a jk aik j L j = Y^ij^j — ^i- 

j ^k,l 7 j V k 7 i 

For the components of ©' follows by comparison with (12.21) 

®ij = Ys a ik a jl®kl- 


( 12 . 22 ) 


(12.23) 


k,i 


This transformation relation is the reason for denoting © as a “tensor.” A tensor of 
rank m is generally defined as any quantity which under orthogonal transformations 
behaves according to the logical extension of (12.23) (summation over m indices), 
e.g., a tensor of third rank 


^ijk — ^ ' kin' a jj'Okk’ Af j'k’ • 
i'J'X 


(12.24) 


© is a tensor of second rank; a vector can because of (12.16) be considered as tensor 
of first rank, a scalar accordingly as tensor of rank 0. One can easily memorize the 
transformation law of a tensor: Each component of the tensor transforms as a vector 
(see (12.16)). 
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For the tensor of inertia, (12.23) can be more clearly represented in matrix nota¬ 
tion: 

©' = A©A _1 . (12.25) 

This is a similarity transformation. 

The matrices A (A -1 ) reduce to row vectors (column vectors) if we want to deter¬ 
mine only the moment of inertia ©• • about a given axis ej from the tensor of inertia 
©£/ in the coordinate system e&. According to (12.23), the desired moment of inertia 
&n is 

& 'u = y^aijau&ji. 

JJ 


Now, according to (12.17), e- = {an, a^, a^} is the vector e- in the basis e ; . Hence, 
the moment of inertia about the rotation axis e- = n = (n\,n 2 ,nf) can obviously be 
written as follows: 


— 'y ' a ij ® jl a il — y ' a ij ® jl )li — ^ ' n j ©y//2/ 

j,l jJ j,l 

/ w l\ 

= (n\,n 2 , ^ 3 )© I n 2 I = n 7 • © • n 

\n 3 l 


= ^2 ®ij n i n j • (12.26) 

iJ 

This relation will be derived more clearly in the context of the subsequent equation 
(12.33). It allows one to calculate the moment of inertia about an arbitrary rotation 
axis n rather quickly. 


12.6 Tensor of Inertia in the System of Principal Axes 

If the three orientations of the principal axes ej = are selected as coordinate axes, 
then 

e- = eo^e 1 +&>2 )e 2 + 0)3^3 = y ^co^pej. 

j 

A comparison with (12.17) shows that in this case 

(i) 

Hence, according to (12.23) the tensor of inertia in the system of principal axes reads 

®’ij = J2 aika J i&u = 534'S O)0 « = Y2 (o k ) (H Bkico i J> 

k,i k,i k v / 


(12.27) 
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Since co^ is an eigenvector of the matrix © with the eigenvalue © ; -, according to 
(12.5), we have 

0w W) = (12.28) 

or, explicitly, 

i 

Therefore, (12.27) turns into 


@ 'u =E 


, SO (T\ 0 ) 

0J k & j M k 


= ©;E 


= 


^.0,0-) 


= ®A’- 


(12.29) 


We thus used the orthonormality (12.16) of the principal axes vectors co^ l \ The co^ 
were assumed to be normalized, which is possible because of the linearity of the eigen¬ 
value equation (12.28) with respect to oo . Equation (12.29) expresses the interesting 
and important fact that the tensor of inertia in its eigenrepresentation (i.e., in the coor¬ 
dinate system with the principal axes <o^ as coordinate axes) is diagonal and exactly 
of the form (12.2). This was to be expected, but it is satisfactory to see how everything 
fits together consistently. 


12.7 Ellipsoid of Inertia 



Fig. 12.8. n characterizes the 
rotation axis 


We define a rotation axis by the unit vector n with the direction cosines n = 
(cosa, cos/3, cos y). According to (12.26), the moment of inertia © about this axis 
is 


© = © n = (cosa, cos cos y) 


'®x 

© 

0, 




®xy ®xz \ / COS Oi \ 

®yy ®;yz I I cos^ I 
®zz / \cos y J 


Multiplying out, we obtain 


© n = © XJC cos 2 a + G yy cos 2 P + © zz cos 2 y 

+ 2G xy cos a cos P + 2® xz cos a cos y + 2©^ cos P cos y. (12.30) 


By defining a vector Q n = n/V®n, we can rewrite the equation as 

®yyQy + ®zzQz ^®xyQxQy 2® xz QxQz T 2(dy Z QyQ z — 1- (12.31) 


This equation represents an ellipsoid in the coordinates ( q x , Q y , q z ), the so-called el¬ 
lipsoid of inertia. 

The distance q from the center of rotation 0 along the direction n to the ellipsoid of 
inertia is £ = 1 /\/©. This allows us to write down at once the moment of inertia if the 
ellipsoid of inertia is known. Each ellipsoid can now be brought to its normal form by 
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Fig. 12.9. The ellipsoid of in¬ 
ertia 


a rotation of the coordinate system, i.e., the mixed terms can be made to vanish. We 
then obtain the form of the ellipsoid of inertia 


©l£?l + ®2£?2 + ®3£?3 — 1- 


(12.32) 


This transformation of the ellipsoid of inertia obviously corresponds to the transforma¬ 
tion of the tensor of inertia to principal axes. This becomes clear by comparing (12.31) 
and (12.32) with (12.1b) and (12.4a). The principal moments of inertia are given by 
the squares of the reciprocal axis lengths of the ellipsoid. For two equal principal mo¬ 
ments of inertia, the ellipsoid of inertia is a rotation ellipsoid, for three equal moments 
a sphere. 

There is also a physical approach to the ellipsoid of inertia which will be presented 
now. Let n = {cos a, cos /3, cos y} be a unit vector pointing along the direction of the 
angular velocity co , so that 

(o = con = oleosa, cos/3, cos y} = co{n i, ri2, n 3} = {aq, 002, ^ 3 }- 

For the kinetic rotation energy, we then obtain according to (12.1a) 




© n denotes the moment of inertia about the axis n. Hence, the moment of inertia about 
an axis with the orientation n is given by 


© n = © 11 cos 2 a + ©22 cos 2 /3 + ©33 cos 2 y 

+ 2©12 cos a cos /3 + 2©i3 cos a cos y + 2©23 cos f$ cos y. 


This agrees with the already known result (12.30). With the coordinates q = 
n/\/©^ = (£ 1 , Q2, £ 3 ), we thus obtain the ellipsoid of inertia 


©ll£l + ©22£>2 + ®33£>3 + 2©i2£l£2 + 2©i3£i£3 + 2©23£2£3 = 1. (12.33) 


The radius of the ellipsoid in the direction n is £ n = 1 / v^©n • 
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Fig. 12.10. The rigid body ro¬ 
tates about the axis n: d v is the 
distance of the mass m v from 
the rotation axis 



Finally, there is still a third approach to the ellipsoid of inertia: According to 
Fig. 12.10, the moment of inertia about the axis n is given by 

©n = Y2, m v d v = ^'Wvli’v x n| 2 . (12.34) 

V V 

We check 


ei 

e2 

e3 

Xy 

Tv 

Zv 

cosa 

COS P 

cos y 


= (jv cosy -z v cos/3)ei + (z v cosa — x v cosy)e 2 
+ (x v cos P — y v cos of)e 3 


dl = |r„ x n| 2 

= (y v cosy — z v cos ^) 2 + (z v cosa — x v cosy ) 2 + (x v cos/J — y v cosa ) 2 
= (yl + Zv) c °s 2 a + (x 2 + z^) cos 2 fi + (x 2 + y 2 ) cos 2 y 

— 2x v y v cos a cos ft — 2x v z v cos a cos y — 2 y v z v cos ft cos y. (12.35) 

Inserting this into (12.34) immediately yields 


®„ = J2 @ U n i n j’ 
ij 


(12.36) 


i.e., again the known ellipsoid of inertia. 

One should realize at this point that the ellipsoid of inertia for a given tensor of 
inertia ©;£ can immediately be written down and drawn according to (12.31). We 
use this method of evaluating moments of inertia in an arbitrary direction in Exer¬ 
cise 12.4. 
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EXAMPLE 


12.2 Transformation of the Tensor of Inertia of a Square Covered with Mass 


The tensor of inertia of the square covered with mass in the x, y -plane was given by 
(compare Example 12.1) 


1 

1 


-©o 

T ©0 

0 

3 

4 


— ®o 

i©o 

0 

4 

3 


0 

0 

^00 


The rotation of the coordinate system by <p = 7r/4 about the z-axis must bring ©' to 
diagonal form, because the angle bisectors of the x, y-plane, as was shown (compare 
Exercise 12.1), are principal axes. The corresponding rotation matrix reads 


( COS (p 

— sin<p 

0 


sirup 
cos <p 
0 


V2 

V2 

2 

2 

V2 

V2 

2 

2 

0 

0 


0 

1 / 


Obviously, 

A~ i =r. 


Performing the matrix multiplication yields in accordance with the former result 


©' = A0A -1 


/ 1 \ 
I2 @ » ° ° 

0 0 

^ 0 0 -©o J 


EXERCISE 


12.3 Rolling Circular Top 

Problem. Find the kinetic energy of a homogeneous circular top (density q, mass m, 
height h, vertex angle 2a), 

(a) rolling on a plane, and 

(b) whose base circle rolls on a plane while its longitudinal axis is parallel to the plane 
and the vertex is fixed at a point. 

Solution. For the calculation of the tensor of inertia, we choose the coordinate system 
so that the longitudinal axis coincides with the z-axis. 
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12 Rotation About a Point 



Fig. 12.11. From the figure it 
is seen that m = (1/3)7 rhR 2 Q, 
R = hi ana, s = R /sin a 


Obviously, 


©x 


= e j^ + Z^)dV = e j f f 


( r 2 sin 2 cp + z 2 )r Jz Jr dcp 


v 

2 7T 


= £ 


I dip I r dr I 
0 0 h(r/R) 


(r 2 sin 2 <p + z 2 ) dz 


TV 


= p—hR 2 (R 2 +4h z ) 
zO 


©xx = — m A 2 (tan 2 a + 4). 


For reasons of symmetry, we have 


®yy — ©xx- 

Likewise, 


(12.37) 


■/ 


©zz =Q / (v 2 + y 2 ) dV = q 


Iff 


r 5 dzdr dcp 


2n R h 



0 0 h{r/R) 


dz = y^qHR 4 , 


© 77 = — mil 2 tan 2 a. 
zz 1Q 


(12.38) 


Since the integrals over cp of xy = r 2 cos<psin<p, xz = rzcoscp, yz = rz sirup with 
the limits 0 and 2n vanish, follows ©^ = © X7 = @ yz = 0. The adopted system is a 
system of principal axes. We therefore set ©i = ©** = © 2 , ©3 = ©zz- 
(a) The kinetic energy in the representation of principal axes reads 


r=i©ia>2 + l© 2W 2+l© 3W 2. (12.39) 

Since we already know the principal axes of inertia and moments of inertia, it remains 
only to express the motion of the cone by the corresponding angular velocities. The 
momentary rotation of the cone happens with the angular velocity 00 about a line of 
support. We can express 00 by <p by considering the velocity of the point B . On the one 
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hand vb = cph cos a, and on the other hand vb = co • R cos a. From this, we find 
a> = <p^- (12.40) 

K 

cp is the polar angle of the figure axis (or, equivalently, the tangential line) in the x',y'~ 
plane; cp is the corresponding angular velocity. 

A decomposition of co in the system of principal axes, where co lies in the v,z-plane, 
leads to = 0 and 


o >3 = co cos a and oo\ = sin a. 

For the kinetic energy, we thus obtain from (12.39) 

T = -©1 Co\ + — © 2^2 + -©3<^3 

13 13 

=- mh 2 ( tan 2 a + 4 )ojo H- mh 2 tan 2 a 2 

2 20 V 7 1 210 

3 3 

= — mh 2 co 2 sin 2 a (tan 2 a + 4) H- mh 2 co 2 sin 2 a 

40 20 


3 , 22 / sin4 a ^ . 2 

= —mh co —»-b 6 sin a 

40 V cos z a 


(12.41) 


(12.42) 


If we replace co by (12.40) and employ R/h = tana = sin a/ cos a, then 


T = — mh 2 (h 2 
40 r 


h l (R l 


R i 


h 2 


sin z a + 6cos z a—=- I = — mh 2 (p 2 { 1 + 5cos z a). (12.43) 


40 


(b) The momentary rotation axis co is again the connecting line between the fixed 
vertex and the point of support. The relation between co and <p is likewise obtained by 
considering the velocity of point A. 


Exercise 12.3 



Fig. 12.13. x', y', 2! labels the 
laboratory system, x,y,z the 
system of principal axes 


We have va = h • (p = coR cos a, from which it follows that co = ip I sin a. The pro¬ 
jection of co onto the principal axes yields 


co\ = co sin a = cp , 


C 02 = 0, 


( 12 . 44 ) 
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12 Rotation About a Point 


Fig. 12.14. Cone rolling on 
the edge of its base 



Hence, for the kinetic energy, it follows from (12.39) that 

13 13 

T =- mh 2 ( tan 2 a +4 )coi H- mh 2 tan 2 a col 

2 20 1 2 10 3 

3 , 2 - i ( r2 , „^ 2 h2 \ 3 , 2 - 2/0 

40 ^ \h 2 h 2 R 2 ) 40 * \ h 2 


(12.45) 


EXERCISE 


12.4 Ellipsoid of Inertia of a Quadratic Disk 


Problem. Determine the ellipsoid of inertia for the rotation of a quadratic disk about 
the origin, as described in Example 12.1. Find the moments of inertia of the disk for 
rotation about (a) the x-axis, (b) the y-axis, (c) the z-axis, (d) the three principal axes, 
and (e) the axis {cos 45°, cos 45°, cos 45°}. 

Solution. The ellipsoid of inertia reads 


©o 2 ©o 

3 e>* 2 exQy + 3 


2 . 20Q 2 , 

Qy + ^~Q Z = 1 - 


(12.46) 


(a) For rotation about the x-axis n = {1,0,0}, and thus q = {1 /a/ 07< 0,0}. Inser¬ 
tion into (12.46) yields 


©0 1 
T' 


0 


x — 


©0 
3 ’ 


as expected. 

(b) Here, n = {0, 1,0}, and following the procedure in (a), we find 



(c) Here, n = {0, 0,1}, and following the procedure in (a), we find 



(d) The third principal axis is identical with the z-axis, which corresponds to (c). 
The first two principal axes are given by 


1 1 

Tl’T !’ 0 


, 1 1 

and n 2 = { - —,—=,0 

V2 V2 


n i = 
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respectively. Therefore, 
1 

Q l = 


^/©i [ V20T \/2©7 


,0 


and 
Q2 = 


n2 


1 


1 


V©2 l V202 ’ ^202’ 


Insertion into (12.46) yields 
©0 1 ©0 1 ©0 1 


3 2©i 2 2©i 3 2©i 


and 


©0 


1 + ©o 


1 ©0 


1 


3 2© 2 2 2© 2 3 2© 2 


+ 0=1 


+ 0=1 


©1 = -f 


©0 

12 ’ 


©2 = 12 ® 0 ' 


These are the principal moments of inertia, as was expected. 

(e) In this case, n is proportional to {cos 45°, cos 45°, cos 45°}. Thus, 


n = 


1 1 


1 


V3’ W V3 

and therefore, 

1 

Q = 


v'e l \/30 ’ \/30 ’ \/3© J 


Insertion into (12.46) yields 

3 3© 2 3© + 3 3© + 3 °3© ” ’ 

from which we find 
10 

© = — © 0 . 

36 

This problem demonstrates the simple handling and the usefulness of the ellipsoid of 
inertia. 


EXERCISE 


12.5 Symmetry Axis as a Principal Axis 


Problem. Demonstrate that an ft-fold rotational symmetry axis is a principal axis of 
inertia, and that in the case ft > 3, the two other principal axes can be freely chosen in 
the plane perpendicular to the first axis. 

Solution. If a body has an ft-fold symmetry axis, then the tensor of inertia must be 
equal in two coordinate systems rotated from each other by (p = 2 tt/ n\ 

© = ©' = A ©A -1 . 


Exercise 12.4 
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12 Rotation About a Point 


Exercise 12.5 


If we select the z-axis as a rotation axis, the rotation matrix reads 


( COS (p 

— sin (p 
0 


sirup 
cos <p 
0 



Multiplying the matrices out, one obtains the components 0C of the new tensor of 
inertia which shall coincide with 0/ ; -. 

0 r n = 0 n = 0 n cos 2 ^ + 022 sin 1 cp + 2012 sin p cos p, 

022 = 022 = 0n sin 2 p + 022 cos 2 p — 2012 sin<pcos<p, 

0 r 12 = 0 12 = —0ii cos p sin p + 022 cos <p sin p + 0i2(l — 2 sin 2 p), 

0 r 13 = 013 = +013 COS p + 023 sinp, 

0 23 = 023 = — 013 sin <P + 023 COS p. 


The determinant of the system of the last two equations, 


cos <p — 1 
— sin<p 


sin<p 
cos <p — 1 


= 2(1 — cos<p), 


vanishes only for p = 0, 2n,.... If there is symmetry (n >2), then we must have 
013 = 023 = 0 , i.e., the z-axis must be a principal axis. 

Two of the remaining three equations are identical, and there remains the system 
of equations 

(022 — 0li) sin 2 cp + 2012 sin <p cos (p = 0, 

(022 — 0n)cos<psin<p — 2012 sin 2 <p = 0. 


The determinant of coefficients has the value 

4 9 9 9 

D = —2 sin p — 2 sin <p cos <p = —2 sin <p. 


There holds D = 0 for (p = 0, tv, 2tv, _Hence, 0n = 022 and 0 i2 = 0, if n > 2. If 

the axis of rotational symmetry z is at least 3-fold, the tensor of inertia is diagonal for 
each orthogonal pair of axes in the x, y-plane. 


EXERCISE 


12.6 Tensor of Inertia and Ellipsoid of Inertia of a System of Three Masses 


Problem. A rigid body consists of three mass points that are connected to the z-axis 
by rigid massless bars (see Fig. 12.15). 

(a) Find the elements of the tensor of inertia relative to the v, y, z-system. 

(b) Calculate the ellipsoid of inertia with respect to the origin 0, and the moment of 
inertia of the entire body with respect to the axis 0 a. 
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Fig. 12.15. Rigid body con¬ 
sisting of three mass points 


Solution, (a) The elements of the tensor of inertia relative to the x,y,z- system are 

&xx = Y^ m i(yf + <b 

i 

= mi(y\ + z\)+m2{yl + z 2 2 )+mj,(yl+zl), 

and after inserting the numerical values from Fig. 12.15, one has 

® xx = 100(144 + 25) + 200(64 + 225) + 150(144 + 196) (gem 2 ) 

= 125.7 (kgcm 2 ). 

Likewise, one obtains 

® yy = 117.5 0<gcm 2 ) and 0,, = 104.75 (kgcm 2 ). 

For the deviation moments of the tensor of inertia, it follows that 

®xy — ~ 'y ' m i (Xjyi) 

i 

= 100(12 • 10) - 200(10 • 8) + 150(11 • 14) (gem 2 ) = 19.1 (kgcm 2 ), 
and likewise, 

@ xz = —44.8 (kgcm 2 ) and & yz = 4.800 (kgcm 2 ). 

(b) From (a) one now immediately obtains for the ellipsoid of inertia with respect 
to the origin 0 (see (12.30)) 

© = © x;c cos 2 a + ©^ cos 2 + © zz cos 2 y 

+ 2 ® xy cos a cos P + 2@ xz cos a cos y + 2© JZ cos P cos y. (12.47) 
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12 Rotation About a Point 


Exercise 12.6 


To calculate the moment of inertia ©o« 2 , we evaluate the direction cosines with the 
coordinates given in Fig. 12.15, 


-6 


cos O' = 

cos P = 
and 

cos y = 


V6 2 + 8 2 + 20 2 
8 

V6 2 + 8 2 + 20 2 


20 


= -0.268, 
= 0.358, 

= 0.895. 


V6 2 + 8 2 + 20 2 

By inserting into (12.47) for the moment of inertia, we obtain 


&o a = (0.268) 2 • 125.7 + (0.358) 2 • 117.25 + (0.895) 2 • 104.75 
-2(0.268X0.358) • 19.1 + 2(0.268)(0.895) • 44.8 
— 2(0.358X0.895) • 4.800 (kgcm 2 ) 

= 128.87 (kgcm 2 ). 


EXERCISE 


12.7 Friction Forces and Acceleration of a Car 


Problem. A car of mass M is driven by a motor that performs the torque 2D on the 
wheel axis. The radius of the wheels is R , and their moment of inertia is © = mR 2 
(m is the reduced mass of the wheels). 

(a) Determine the friction force f which acts on each wheel and causes the accelera¬ 
tion of the car. The street is assumed to be planar. 

(b) Calculate the acceleration of the car if the torque 2D = 10 3 J, M = 2 • 10 3 kg, 
R = 0.5 m and m = 12.5 kg. 



Fig. 12.16. Wheel, accelera¬ 
tion a s , and friction force f 


Solution, (a) Fig. 12.16 shows one of the wheels and the force f acting on it. Since 
the linear acceleration of the wheel center is the same as that of the center of gravity 
of the car a s , one has 

Msl s = 2f —F. (12.48) 

The factor 2 accounts for the fact that a car in general is driven by two wheels. F 
is a possible external force which impedes the motion (air resistance), and a s is the 
acceleration of the car. For the torque relative to the axis, one obtains 


4©w = 2 (D-fR), 


(12.49) 
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where 0 is the moment of inertia of each of the four wheels, D is the accelerating 
torque, and — fR is the torque performed by the friction force on each wheel. The 
moment of inertia is © = mR 2 . If the car does not glide, one has 

cbR = a s , (12.50) 


and with (12.49) and (12.48), it immediately follows that 


d)R = a s 


1 / d/; ~/# 2 \ 

2 \ mR 2 ) 


2f-F 

M 


(12.51) 


Equation (12.51) finally yields for the friction force 


_ 1 (2D/R)M + 4mF 
^ 2 M + 4 m 

and by neglecting the backdriving force F (F = 0), we have 

f _ D/R 
1 1 + (4m/M )' 


(12.52) 


(12.53) 


(b) By replacing / in (12.49) by (12.53) and solving for a s (F = 0), one finds the 
acceleration of the car 


2 D/R _ 10 3 /0.5 _ 10 3 m 

M + 4m ~ 2- 10 3 + 4- 12.4 “ 1025 ~ 1 


With the numerical values from (b), the friction force / is given by 


/~ 


— = 1000 N. 
R 


Exercise 12.7 
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13.1 The Free Top 


A rigid, rotating body is called a top. A top is called symmetric if two of its principal 
moments of inertia are equal. If ©i = © 2 , we further distinguish 

(a) ©3 > ©i oblate top or flattened top, e.g., a disk; 

(b) ©3 < ©i prolate top or cigar top, e.g., an (extended) cylinder; and 

(c) ©3 = ©i spherical top, e.g., a cube. 

The third principal axis of inertia which is related to ©3 is called the figure axis. It 
specifies the spatial orientation of the top. For rotationally symmetric bodies it co¬ 
incides with their symmetry axis. Hence, the center of gravity of a rotational body 
always lies on the figure axis. Moreover, we must distinguish between the free top and 
the heavy top. For the free top one assumes that no external forces act on the body, 
so that the torque with respect to the fixed point vanishes. On the heavy top forces 
act, for example gravity. One can however imagine other forces (centrifugal forces, 
friction forces, etc.). For an experimental realization of a free top we only have to 
support an arbitrary body at the center of gravity. The body is then in an indifferent 
equilibrium, and there is no torque acting on it. 


(a) 0 3 > S l oblate top or 
flattened top, e.g., 
a disk 


(b) © 3 < ©j prolate top or 
cigar top, e.g., 
a (long) cylinder 


(c) © 3 = ©j spherical top, e.g., 
a cube 




Fig. 13.1. (i) Possible real 
form of the top. (ii) Ellipsoid 
of inertia 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_13, © Springer-Verlag Berlin Heidelberg 2010 
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13 Theory of the Top 


Fig. 13.2. Model of a free 
top supported at the center of 
gravity S : The construction is 
so that S is also the supporting 
point 



For a theoretical description of the top, we start from the basic equations 

L = © • oo = constant (conservation of angular momentum), (13.1) 

T =-oo • L = constant (conservation of kinetic energy). (13.2) 

The angular momentum L and the kinetic energy T of the free top are constant in 
time. This is the content of the last two equations. 


13.2 Geometrical Theory of the Top 

We first will derive the laws governing the free top from geometrical considerations. 
The geometrical theory of the top is based on Poinsot’s 1 ellipsoid (also called the 
energy ellipsoid ): 

^xx^x 3 ” ®yy M y 3 ” ®zz^z 3 ” 2 ® X y(D x cOy + 2 ® xz co x co z + 20y Z cOyCO z 

= 2 T = constant. (13.3) 

This ellipsoid in the cu-space is immediately obtained from (13.2). It is similar to the 
ordinary ellipsoid of inertia and has the same body-fixed axes. 

In the subsequent considerations, we shall utilize the property of (13.3) that the 
endpoint of the vector oo lies just on the surface of the ellipsoid. 

Now follows Poinsot’s construction of the motion of the free top. The angular mo¬ 
mentum vector is constant and defines an orientation in space. The straight line de¬ 
termined by L is therefore called the invariable straight line. Moreover, the kinetic 
energy is constant, hence 2T = oo L = constant; from the definition of the scalar 
product immediately it follows that 

co • cos (oo, L) = constant. (13.4) 

In other words, the projection of oo onto L is constant. If one now considers oo as the 
position vector for points in space, the parameter representation oo(t) fixes a plane 
which is called an invariable plane. The invariable straight line is then perpendicular 
to the invariable plane. 

Now one can describe the motion of the top by the rolling of the Poinsot ellipsoid 
on the invariable plane. This is allowed since the endpoint of oo, as is evident from (13.4), 


1 Louis Poinsot , French mathematician and physicist, b. Jan. 3, 1777, Paris-d. Dec. 5, 1859, Paris. 
Professor in Paris, introduced in his Elements de statique (Paris, 1804) the concept of the couple to 
mechanics and used it to represent the motion of the top. Poinsot-motion means the motion of a free 
top. 
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lies on the surface of the ellipsoid and moves in the invariable plane. The invariable 
plane is also a tangent plane of Poinsot’s ellipsoid, since there is only one common 
vector co , and hence the ellipsoid and the plane have a common point. To prove this, 
we show that at the point co the gradient of the ellipsoid is parallel to L. From vector 
analysis we know that the gradient of a surface is perpendicular to this plane. The 
surface of the ellipsoid F is described by (13.3). 

Because 2 

dF dF dF 
3 co x ’ dcoy ’ 3 <j) z 

we obtain 

1 / ®xxU>x + ®xyMy + ®x Z 0> Z \ _ 

— W^F = I ® X y(L) X + ®yy(i)y + ®y Z Q) Z I = ©Cl) = L, 

\ ®x Z a) x + ®y Z My + ® ZZ CD Z ) 

i.e., gradcoF is parallel to L or F _L L; therefore, the tangent plane of F at the point co 
is parallel to the invariable plane. 

Since the center of the ellipsoid is a constant distance from the invariable plane 
(see (13.4)), the motion of the top can be described as follows: The body-fixed Poinsot 
ellipsoid rolls without gliding on the invariable plane, where the center of the ellipsoid 
is fixed. The instantaneous value of the angular velocity is then given by the distance 
from the center to the contact point of the ellipsoid. 

The ellipsoid rolls but does not glide. This follows from the fact that all points along 
the cu-axis are momentarily at rest; hence the contact point is also momentarily at rest. 
Rolling without gliding means that the changes of the rotation vector co measured from 
the laboratory system and from the body-fixed system are equal. Actually, 



dco 

dco 

+ CO X co, 

dco 

i.e., —— 

dco 

dt 

l dt 

K 

dt 

l dt 


Concerning the difference between gliding and rolling, if a wheel rolls on a plane, 
the velocities of change of the contact point P in the body-fixed and in the laboratory 
system are equal. If the wheel glides, the contact point in the body-fixed system is 
fixed; in the laboratory system its position changes permanently. 


2 Since the surface (13.3) is defined in the w-space, we mean by gradient the w-gradient, i.e., V w = 

{3/3 (o x , d/da)y, 3/3 co z }. 


Fig. 13.3. Invariable straight 
line and invariable plane 



Fig. 13.4. On the condition of 
rolling 
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13 Theory of the Top 


The trajectory of ( o on the invariable plane is called the herpolhodie or trace trajec¬ 
tory , the corresponding curve on the ellipsoid is called the polhodie or pole trajectory. 
See Fig. 13.5. 


Fig. 13.5. The Poinsot ellip¬ 
soid rolls on the invariable 
plane 



The polhodie and the herpolhodie are in general complicated, not closed curves. 
For the special case of a symmetric top, the Poinsot ellipsoid turns into a rotation 
ellipsoid, and by rolling of the rotation ellipsoid there arise circles, co has constant 
magnitude but permanently changes the direction, i.e., co rotates on a cone about the 
angular momentum axis. This cone is called the herpolhodie- or trace cone. For a 
symmetric cone it is efficient to use the symmetry axis (figure axis) as third axis for 
describing the motion. The figure axis that is tightly fixed to the ellipsoid rotates just 
like co rotates about L. The cone resulting this way is called the nutation cone. The 
motion of the figure axis of the top in space is called nutation. (The term precession 
used in the American literature makes little sense, since the term means a motion of 
the heavy top that is of a completely different origin.) 

An observer who is in the system of the top and considers the figure axis as fixed 
will find that co and L rotate about this axis. For the cone arising by the rotation of 
co the term polhodie- or pole cone is introduced. The precise orientation of the axes 
and cones depends essentially on the shape of the rotation ellipsoid. This is shown 
by the following two diagrams on the orientation of the axes. Note that a large prin¬ 
cipal momentum of inertia @3 corresponds to a small radius of the Poinsot ellipsoid, 
namely, ^/2T/Gi,. The other axes of the Poinsot ellipsoid accordingly have the lengths 
a/ 2T/©i and +J2T /© 2 , respectively. 

This is immediately seen from the form of (13.3) in terms of the principal axes: 


co; 


1/01 


+ 


coX 


+ 


cox 


1/02 1/03 


= 2T = constant. 


Figure 13.6(a) shows the ellipsoid of a flattened (oblate) top; Fig. 13.6(b) represents 
a prolate top. In the first case, the axes have the sequence co —L—figure axis; in the 
second case, the sequence is L— co —figure axis. 

Likewise is the sequence of the cones introduced above. Figure 13.6(c) shows the 
case of an oblate top, and Fig. 13.6(d) that of a prolate top. We note that the three axes 
lie in a plane. 
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Fig. 13.6. (a) Oblate symmet¬ 
ric top. (b) Prolate symmet¬ 
ric top. (c) Oblate symmetric 
top: The pole cone rolls inside 
of the trace cone, (d) Prolate 
symmetric top: The pole cone 
rolls outside of the trace cone 



13.3 Analytical Theory of the Free Top 

We consider the motion of the angular momentum and angular velocity vectors from 
a coordinate system that is tightly fixed to the top and moves with it. For the angular 
velocity, we have 

co = co\e\ -hCt>2^2 + &>3e3, 

where ei, e 2 , and e 3 are body-fixed principal axes of the top. We now investigate the 
angular momentum of the top no longer in the moving coordinate system, i.e., in the 
system of the top that rotates with co in the laboratory system, but transformed into 
the laboratory system, using our knowledge of moving coordinate systems. We then 
obtain 


L|lab — L| t0 p + CO X L. 

Because L| top = ©<*>, for the component in the laboratory system we have 

ei e 2 e 3 

CO] Cl>2 CO 3 

0 \ 60 \ ©2&>2 ©3CU3 


L|iab = ©l^lCi + ©2U>2^2 + 03^3^3 + 
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Solved for the components ei, e2, and e3 and combined, this reads 

L|lab = (©1<W1 + ©3&>2&>3 ©2^2<^3)ei 

+ (©2<i>2 + © \6l)\(jl>2 — ©3CL>iO;3)e2 
+ (©3<i>3 + ©2<^>1^2 — © 1 CO\ <^>2) e 3 • 

Since the laboratory system is an inertial frame of reference, we have the relation 

L = D. 


The torque is again expressed by the body-fixed coordinates, and we obtain 
L|lab = D\e 1 + £>2^2 + £>3^3- 
Thus, we find the Euler equations : 

£>i = G\6)\ + (©3 — ©2)&>2&>3, 

£>2 = ©2<i»2 + (©1 — © 3)^1 0)3 , ( 13 . 5 ) 

£>3 = ©3U>3 + (©2 Sl)c0\C02- 

These three coupled differential equations for co\(t ), a)2(t ), and 0)3 (t) are not linear. 
This suggests that in general the solutions &>; (0 are rather complicated functions of 
time. Only in the case of free motion (D = 0) can one obtain a transparent solution 
which will be discussed now. Later we shall deal with the heavy top for which D/0. 

We choose the body-fixed coordinate system so that the e3-axis corresponds to the 
figure axis. Since we will restrict the analytical consideration of the theory of the top 
to a free symmetric top that shall be symmetric about the figure axis, the following 
conditions hold: 


LIlab = D = 0, i.e., D\ = £) 2 = £^3 = 0, and ©1 = © 2 . 


We show that for a symmetric top e3, co , and L lie in a plane. For this we have to 
calculate the scalar triple product of the three vectors which must vanish: 


e 3 • (co x L) = e 3 • 


ei 

o) f 

©1 COl 


e 2 e 3 

(Ji>2 CO 3 

©2^2 ©3&>3 


= (©2 — © 1 )qJ\(l>2 = 0, 


because ©i = ©2. 

With the conditions for the free symmetric top, the Euler equations read 
©30)3 = 0 =>► (03 = constant, 

© 1 QJ\ + (©3 — ©1 )Ol> 2 m 3 = 0, 

© 1 6l>2 + (©1 — © 3)^1 6 l >3 = 0 . 

Thus, the component of co along the figure axis is constant. To show this in the subse¬ 
quent calculation, we set 


0)3 = u. 
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To solve the two differential equations, we differentiate the second equation with re¬ 
spect to time: 

0 1 C0\ + (©3 — 0] )U6l)2 = 0, 0] 002 T (©1 — © 3 )w£l>] = 0. 

By solving the last equation for 002 and inserting into the first one, we obtain 

(©3_©l)2 2 


(L>\ + 


©f 


-uco\ = 0 . 


This form of the differential equation is already known: setting 
| ©3 — © 1 1 


©1 


-u = k, 


we see that co\ + k 2 co\ = 0 is exactly the differential equation of the harmonic oscilla¬ 
tor, which is solved by 

co\ = B sin kt + C cos kt. 


Considering the initial condition co\ (t = 0) = 0, it follows that co\ = B sin kt, or from 
the second equation, C 02 = — B cos kt. 

The result means that co moves on a circle about the figure axis, as seen from the 
system of the top: 

co = B(smkt ei — cos kt^2) + *^ 3 . 

The rotational frequency is thereby given by k\ for k > 0 the rotation proceeds in 
the mathematically positive sense. The cone arising in the rotation is again called the 
pole cone. The angular momentum, which is given by L = © • co, also changes with 
time: 


^ 2 ' 

r 

\jo ± (0 

u 

co^O)^^ 

e i 

'■ — kt 


Fig. 13.7. Motion of co{t) in 
the ei, e 2 -plane 


L = (d\B sinkt e\ — G\B cos kt e 2 + © 3 ^ 3 , 

i.e., the L-axis rotates with the same frequency k but with a different amplitude about 
the figure axis (nutation). This is no contradiction to the statement |L|i a b = constant, 
since we measure the angular momentum from the system of the top. 

Finally, we determine the angles between the three axes. We set 

4 (e 3 , L)=a, 4 (e 3 > <°) = P, 

and scalar multiply e 3 and L; this yields 

e 3 • L = L cos a = V (©i #) 2 + (©3 u) 2 cosa 


or 


e 3 • L = e 3 • (©i^iei + ©20^2 + ©3^3^) = ©3^3 = © 3 ^- 

Equating both equations leads to 

©3 u 1 

COS OL = — - — — — 

V (©1 B ) 2 + ( 0 3 m ) 2 7 (© iS /© 3 m ) 2 +1 
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Fig. 13.8. 


or 


cos O'. 


© 1 B 
© 3 w 


+ 1 = 1 . 


Comparison of the coefficients with the trigonometric formula 
cosxV tan 2 x + 1 = 1 


yields tana = ©i#/© 3 w = constant. 

Performing the same calculation for e 3 • co, for we find 

B 

tan /j = = constant. 

u 

The comparison of the last two results shows the dependence of the orientation of 
the axes on ©i and © 3 : One has 

tana/tan= ©i/© 3 , 

from which it follows that 

(1) ©i > © 3 (prolate top) =+ a > /3 for a, /3 < 7 t/ 2; sequence of axes: e 3 — co — L; 

(2) ©i < © 3 (oblate top) =+ a < for a, /3 < 7 t/ 2; sequence of axes: e 3 — L — co; 

(3) ©! = © 3 (spherical top) =+ a = for a, < n ; co lies on the L-axis. Since the 
e 3 -axis of a spherical top can be chosen arbitrarily, there is no loss of generality if 
we set a = /3 = 0. 

For (3) we note that, for the spherical top, k = w(© 3 — ©l)/©l =0 because ©i = © 3 . 
For the spherical top, as was discussed above, we can set the figure axis (i.e., the 
e 3 -axis) arbitrarily, e.g., also along the L- or w-axis. The result a = would follow 
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also from 


~ / 01 

wxL = a)x@iw = 0 because ©=| 0 

V o 


0 0 \ 
01 0 
0 0!/ 


EXAMPLE 


13.1 Nutation of the Earth 

The earth is not a spherical top but a flattened rotation ellipsoid. The half-axes are 
a = b = 6378 km (equator) and c = 6357 km. 

If the angular momentum axis and the figure axis do not coincide, the figure axis 
performs nutations about the angular momentum axis. The angular velocity of the 
nutations is 

0 3 -0l 

k = -&> 3 . 

01 

The third axis is the principal axis of inertia (pole axis). If we consider the earth as 
a homogeneous ellipsoid of mass M, we obtain the two moments of inertia: 


0! =02 = — (b 2 + C 2 ), 


From this, we obtain 


A/L 9 9 

@3 = y (a 2 + b 2 ). 


k = 


2 2 
a L — c A 


7 0 ) 2 . 


b 2 + c 2 

Since the half-axes differ only a little, we set a = b ~ c, and thus, 


7 (a — c)(a + c) 

t= i,2 + e 2 


-&> 3 . 


The rotation velocity of the earth is &> 3 = 27r/day. Thus, we obtain for the period of 
nutation 

2n 

T = — = 304 days. 
k 

The figure axis of the earth (geometrical north pole) and the rotation axis oo of 
the earth (kinematical north pole) rotate about each other. The measured period (the 
so-called Chandler period ) 3 is 433 days. The deviation is essentially caused by the 
fact that the earth is not rigid. The amplitude of this nutation is about ±0.2". The 


3 Seth Carlo Chandler , b. Sept. 17, 1846, Boston, Mass.-d. Dec. 31, 1913, Wellesley Hills, Mass. 
American astronomer, detected the Chandler period of 14 months in the pole height fluctuations. He 
observed variable stars, and for a long time he edited the Astronomical Journal. 
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Example 13.1 



Fig. 13.9. Regular tetrahedron: 
g and h are straight lines (axes) 
that are perpendicular to the 
planes opposite C and D 


Fig. 13.10. Homogeneous 
three-axial ellipsoid: (a) side 
view, and (b) top view 


kinematical north pole moves along a spiral trajectory within a circle of 10 m radius 
in the sense of the earth’s rotation. 


EXAMPLE 

13.2 Ellipsoid of Inertia of a Regular Polyhedron 

The ellipsoid of inertia of any regular polyhedron is a sphere, which will be shown 
by the example of the tetrahedron; the reasoning for the octahedron, dodecahedron, 
and icosahedron is analogous. Suppose there were a principal axis of inertia with a 
moment which differs from those of the two other principal axes of inertia. In a rota¬ 
tion by 120° about the axis g (perpendicular from point C on the opposite plane; see 
Fig. 13.9), this axis of inertia must turn into itself, since the tetrahedron is transferred 
into itself. It is easily seen that only the axis g has this property and therefore must 
be the distinguished axis of inertia. But since h is a symmetry axis too, a rotation by 
120° about h must also transfer the axis of inertia g into itself, which however is not 
true. Assuming the existence of a distinct axis of inertia leads to a contradiction, and 
hence the ellipsoid of inertia of a tetrahedron must be a sphere. 


EXERCISE 

13.3 Rotating Ellipsoid 

Problem. A homogeneous three-axial ellipsoid with the moments of inertia ©i, 
© 2 , ©3 rotates with the angular velocity <p about the principal axis of inertia 3. The 
axis 3 rotates with ft about the axis AB. The axis AB passes through the center of 
gravity and is perpendicular to 3. Find the kinetic energy. 



Solution. We decompose the angular velocity co into its components along the prin¬ 
cipal axes of inertia: 


Example 13.1 



Fig. 13.9. Regular tetrahedron: 
g and h are straight lines (axes) 
that are perpendicular to the 
planes opposite C and D 


Fig. 13.10. Homogeneous 
three-axial ellipsoid: (a) side 
view, and (b) top view 


co = (oo\, cl> 2 , ( 03 ), where co\ = ft cos ^ 9 , co 2 = —ft sin (p , or > 2 = cp. 


The kinetic energy is then 
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The ellipsoid shall now be symmetric, ©i = © 2 ; the axis A B is tilted from the Exercise 13.3 

third axis by the angle a . For the total angular velocity, we have 

co = (pe 3 + $e A B- 



e 3 



Fig. 13.11. The axis AB tilted 
from the third axis by a : Posi¬ 
tions of the axes are shown in 
perspective 


We decompose the unit vector e A B along the axis AB with respect to the principal 
axes 

e AB = e 3 • cos a + (cos^ei — sin<pe 2 ) sin a. 

Thus, the components of oo along the directions of the principal axes are 

oo\ = sin a cos (pft, 

0)2 — — sin a sin (pft, 

0)2 — cp + cos aft. 

Hence, the kinetic energy reads 

T = -©i sin 2 ^ 2 + -© 3 (cp + ft cosa) 2 . 

For a = 90°, we obtain for the first case a rotation ellipsoid. 


EXERCISE 

13.4 Torque of a Rotating Plate 

Problem. Find the torque that is needed to rotate a rectangular plate (edges a and b) 
with constant angular velocity oo about a diagonal. 

Fig. 13.12. The rectangular 
plate rotates about the diago¬ 
nal axis 


Solution. The principal moments of inertia of the rectangle are already known from 
Example 11.7: 

/i = T Ma 2 , I 2 = j^Mb 2 , It, = -Em (a 2 + b 2 ). 



(13.6) 
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Exercise 13.4 


The angular velocity is 

(O — (CU • “1“ 

i.e., 

cob coa 

(o =- e>- H- e v 

\/ a 2 + b 2 \/ a 2 + b 2 

—cob -\-coa 

=* U>\ = —F==, ^2 = ^==, ^3 = 

wa 2 + b 2 \/a 2 + b 2 

Inserting (13.6) and (13.7) into the Euler equations yields 

I\CO\ + (/3 — l2)C02m = £>i, 

hroi + (7i — h)u>?>(D\ = £>2, 

I3CO3 + (I2 — I\)C02C0\ = D3, 


and furthermore, D\ = 0, £>2 = 0, and 

_ -Milo 2 - a 2 )abco 2 

° 3 ~ 12(a 2 + 6 2 ) 

Hence, the torque is 

_ —M(b 2 — a 2 )aba> 2 
12 (a 2 + b 2 ) z 

For a = b (square), D = 0! 


(13.7) 


EXERCISE 


13.5 Rotation of a Vibrating Neutron Star 


Problem. The surface of a neutron star (sphere) vibrates slowly, so that the principal 
moments of inertia are harmonic functions of time: 


I zz = -mr(l + e coscot), 


Ixx — lyy — rnr ( 1 £ 


( cos cot \ 


£ <<C 1. 


The star simultaneously rotates with the angular velocity Q (t). 

(a) Show that the z-component of Q remains nearly constant! 

(b) Show that Q ( t ) nutates about the z-axis and determine the nutation frequency for 

Q z co. 















13.3 Analytical Theory of the Free Top 


221 


Solution, (a) If the total angular momentum is given in an inertial system, then 

^ = 0 . 


dt ) 


inertial 


The principal moments of inertia are, however, given in a body-fixed system that ro¬ 
tates itself with the angular velocity £2 with respect to the inertial system. Then 


dL\ 


dt J 


inertial 


dL\ 


— I = I — 1 + £2 x L = 0. 


dt J k 


One therefore gets in the body-fixed system (Euler equations) 


(I ZZ Q Z ) — 0, 

dt 

d 3 

(Ixx ^2jc) H" ~ COS ^ = ^5 

d 3 

~ (lyy ^2-y ) ~ Iq ^2jc £2^ ^ COS COt == 0, 


(13.8) 

(13.9) 
(13.10) 


where /o = (2/5 )mr 2 is the moment of inertia of the sphere. (13.8) has the solution 
£2oz 


Q 7 


1 + £ COS cot ’ 


where £2o z follows from the initial conditions; this means that Q z is only very weakly 
time dependent. 

(b) We suppose that co £2 Z , i.e., 


dl x 


* 0 and 


dl 


yy 


dt dt 

From this, we find 


< 0 . 


3 . 3 

I xx -1— ~ I 0 ^2^ s cos cot ^2-y — 0, lyy ^2^y ~ Iq ^2^ s cos cot £2^ == 0. 

Differentiating again and inserting (13.8), (13.9), and (13.10) yield 


(13.11) 


Ixx^x T" 


lyyQy 


1 / 3 


-/o^2 z ecoso;M Q x = 0, 


yy 

1 / 3 


-/o£ 2 z £ coscot ) Qy = 0. 


(13.12) 


If Ixx — lyy ^ ^0? then 


Q x + ( -s£l z cos cot ) Q x = 0, 


Q y + I -st2 Z cos cot ) Q y = 0. 


Since co^Q z (we further assume that co <^i e£2 z ), we find 
3 

co n = -£Q z cos cot (nutation frequency), 
i.e., Q x and Q y perform a nutation motion with co n . 


Exercise 13.5 
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Fig. 13 . 13 . Geometry and piv¬ 
oting of the rotating circular 
disk 


EXERCISE 


13.6 Pivot Forces of a Rotating Circular Disk 


Problem. A homogeneous circular disk (mass M, radius R) rotates with constant 
angular velocity co about a body-fixed axis passing through the center. The axis is 
inclined by the angle a from the surface normal and is pivoted at both sides of the disk 
center with spacing d. Determine the forces acting on the pivots. 


Solution. The Euler equations read 


I\co\ — ( 02 ( 03(12 — h) = D\, 

(13.13) 

I 2(^2 — co\ 003(13 — I\) — D 2 , 

(13.14) 

13003 — oo\ 002(1 \ — 1 2 ) = D 3 , 

(13.15) 


where D = {Di, D 2 , D 3 } represents the torque in the body-fixed system. We choose 
the body-fixed coordinate system in such a way that n = e 3 and ei lies in the plane 
spanned by n, 00 . For the principal momentum of inertia / 1 , we have 


In R 


In R 


2n 


I\ — g j j y 2 r dr dip = a J J 1 


y z rdrdcp = G j j r 3 sin 2 ip drdcp = - 


r fi4 / 


r\ 

sin ip dip 


0 0 


1 


= -gR*7T = ~ 


1 / M 


4 \jcR 2 


0 0 


R 4 jt = -MR 2 , 
4 


(13.16) 


since the surface density a is given by a = M/F = M/rtR 2 . And likewise for I 2 
and 73 


1 1 7 

Il=I 2 = -h = -MR 2 . 

2 4 

The components of the angular velocity vector are given by 


(13.17) 


co = {co\ = co sin a, 002 = 0, C 03 — a; cos a}. (13.18) 

Because 00 = 0, inserting (13.17) and (13.18) into (13.13) to (13.15) yields 

1 


D 1 = D 3 = 0 and D 2 = —co 2 sina cosa-MR 2 . 


(13.19) 

Because D = r x F, in the pivots act equal but oppositely directed forces of magnitude 


ID 2 I 9 9 1 /I \ 9 9 sin 2 a 

|F| = — =MR 2 u > 2 — [ - sin 2a =MR 2 co 2 - 

2d 4d\4 ) 16 d 

(see Fig. 13.13). 


(13.20) 
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EXERCISE 


13.7 Torque on an Elliptic Disk 


Problem. What torque is needed to rotate an elliptic disk with the half-axes a and 
b about the rotation axis OA with constant angular velocity cool The rotation axis is 
tilted from the large half-axis a by the angle a . 



Solution. We choose the ei-axis orthogonal to the plane of the drawing, e 2 along the 
small half-axis b , and e 3 along the large half-axis. The principal moments of inertia 
are then (M = anab, dM = adF) 


+a <p(z) 


h = cf 


J J z 2 dzdy with <p(z) = bJl — ^ 


-a -<p ( z ) 

from the ellipse equation z 2 /a 2 + y 2 jb 1 = 1. 


h 


-\-a 

=°f z 


z 2 y 


b^J 1—z 2 /a 2 
—b^/l—z 2 /a 2 


dz 


-\~ci 

= 2 ob J z 


z 2 x \ 1 - ^ dz 
a z 


= 2a-\^(2z 2 -a 2 )y]a 1 - z 2 + ^-arcsin-^-1 
a [ 8 8 \a\ J 


+a 


1 3 1 9 

= -aba Tt = -Ma 
4 4 


accordingly, 


(13.21) 


+b <p(y) 


h 


= cr j J y 2 dydz with <p(y) = aj 1 - 


2 

*2 


- b-(p(y ) 


1 


/ 3 = -Mb . 


(13.22) 


We can immediately write down /i (because I\ = h + / 3 for thin plates): 
1 


/ 1 = -M(tf 2 + 6 2 ). 


(13.23) 


For w, we obtain 


Fig. 13.14. 


(o = 0 • ei — coo sin a • e 2 + coo cos a • e 3 . 
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We insert into the Euler equations of the top: 


D\ = I\co\ + (I3 h) 0 ) 2 U >3 

= — ~M(b 2 — a 2 ) sin a cosa • co^, 
13 2 = I2C02 + (11 - 13)0)1(03 = 0 , 

£>3 = I3CO3 + (I2 - h)(0\(02 = 0 . 


Thus, we obtain for the desired torque D 

2 ^7 2 2 

D = —co 0 •—(b — a ) sin 2a • ei. 

8 

It is obvious that 


(13.24) 


(13.25) 


(1) for a = 0, 7t/ 2, 7T, ..., the torque vanishes, since the rotation is performed about 
a principal axis of inertia; and 

(2) for b 2 = a 2 , i.e., the case of a circular disk, the torque vanishes for all angles a. 

We will consider once again the last conclusions: Given an elliptic disk with half-axes 
a and b, for a = 0°, 180°, or for a = 90°, 270°, the rotation axis coincides with one of 
the principal axes of inertia along the half-axes. In this case the orientation of the angu¬ 
lar momentum is identical with the momentary rotation axis. Because coo = constant, 
we also have L = constant, and therefore the resulting torque vanishes. This also re¬ 
sults by insertion into the Euler equations of the top: co = (0, 0, 0), co = (0, 0, coo) or 
co = ( 0 , coo, 0 ) 


=> D\ = I\CO\ + (I 3 — l 2 )C 02 C 03 = 0, 

D 2 = 72^2 H - (h — I?>)ct2\co2 = 0, (13.26) 

O3 = I3C03 + (h — h)co\co2 = 0 . 


13.4 The Heavy Symmetric Top: Elementary Considerations 



Fig. 13.15. A heavy top: The 
center of gravity S and the bear¬ 
ing point O are sketched in 


We now consider the motion of the top under the action of gravity. If the bearing point 
O of the top does not coincide with the center of gravity S’, gravity performs a torque. 
To distinguish the top from the freely moving top, it is called the heavy top. First we 
restrict ourselves to the symmetric top which rotates with the angular velocity co about 
its figure axis. The origin of the space-fixed coordinate system is set into the bearing 
point O , the negative z-axis points along the gravity force. Let the distance OS = 1; 
gravity acts on the top of mass m with the torque D = 1 x mg. Hence, the angular 
momentum vector is not constant in time: 

L = D or JL = D -dt. 

This differential form of the equation of motion expresses that the torque causes a 
change JL of the angular momentum which is parallel to the torque D. 
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Fig. 13.16. Position of the 
heavy top in the coordinate 
system 


Sommerfeld 4 5 and KleiiF in their book Theory of the Top called this phenomenon— 
philosophizing— “Die Tendenz zum gleichsinnigen Parallelismus.” The z-component 
of the angular momentum is however conserved. This results from the following con¬ 
sideration: Because g = — ge z , we have D = mge z x 1, i.e., the torque has no com¬ 
ponent along the z-direction, and hence L z is constant. Hence the torque D causes a 
motion of the angular momentum vector L on a cone about the z-axis; this motion 
of the heavy top is called precession. The precession frequency of the top is thereby 
constant by reason of symmetry; the relative orientation of the torque and angular 
momentum vectors is also constant. 

We now calculate the precession frequency. For this purpose we start from the 
radial component L r of the angular momentum: 

L r = L sin d. 

The angle covered by L r in the time dt is 

d dL r dL Ddt 
L r L r L sin#’ 

For the precession frequency co p = da/dt, we thus have 

D mgl 

p Lsintf L 



Fig. 13.17. On the calculation 
of the precession frequency co p 


or in vector notation 


(o p x L = D. 


4 Arnold Sommerfeld, b. Dec. 5, 1868, Konigsberg-d. April 26, 1951, Munich, physicist. From 1897, 
professor in Clausthal-Zellerfeld and from 1900 in Aachen, successfully tried for a mathematical 
backup of technique. In 1906, Sommerfeld became professor for theoretical physics in Munich, 
where he was an excellent academic teacher for generations of physicists (among others P. Debye, 
P.P. Ewald, W. Heisenberg, W. Pauli, and H.A. Bethe). He extended Bohr’s ideas in 1915 to the 
“Bohr-Sommerfeld theory of atom” and discovered many of the laws for the number, wavelength, 
and intensity of spectral lines. His work Atomic Structure and Spectral Lines (Vol. 1, 1919; Vol. 2, 
1929) was accepted for decades as a standard work of atomic physics. Further works: Lectures on 
Theoretical Physics, six volumes (1942-1962). 

5 Felix Klein, b. April 25, 1849, Dusseldorf-d. June 22, 1925, Gottingen. Klein studied from 1865 to 
1870 in Bonn. During a stay in 1870 in Paris, he became familiar with the rapidly developing group 
theory. From 1871, Klein was a private lecturer in Gottingen, from 1872, professor in Erlangen, from 
1875, in Munich, from 1880, in Leipzig and from 1886, in Gottingen. He contributed fundamental 
papers on function theory, geometry, and algebra. In particular, group theory and its applications 
attracted his interest. In 1872, he published the Program of Erlangen. In later life Klein became 
interested in pedagogical and historical problems. 
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Fig. 13.18. In general, a nu¬ 
tation is superimposed on the 
precession 


Hence, the precession frequency is independent of the inclination ft of the top if 
ft ^ 0 is presupposed. In the general case nutation motions are superimposed on the 
precession, so that the tip of the figure axis F no longer moves on a circle but on a 
much more complicated trajectory about the z-axis. The angle ft varies between two 
extreme values 

fto — Aft <ft<fto + Aft 

(see Fig. 13.18). If D = 0, there is only the nutation of the figure axis F about the then- 
invariable straight line L. For D / 0, the precession of L about the z-axis dominates. 
The nutation is superimposed on this precession. 

Since in the special case considered here the vectors of angular momentum and 
angular velocity coincide with the figure axis of the top, we can write for the angular 
momentum 


L = © 3 <U, 

where ©3 is the moment of inertia about the figure axis. For the torque we then have 
the relation 


D = © 3 ^ X (O . 

The inverse of this moment D r = — D = © 30 ) x co p is called the top moment. It is 
that torque the top performs on its bearing if it is turned with the angular velocity co p . 
This top moment—sometimes also called the deviation resistance —can reach very 
large values for a quickly rotating top and a sudden turn of its rotation axis. We feel it 
for example if we suddenly turn the axis of a quickly rotating flywheel held in the hand 
(e.g., wheel of a bicycle). The top moment observed from a reference system rotating 
with the angular velocity 00 p is identical with the moment of the Coriolis forces, which 
can be proved. 

The earth also precesses under the action of the gravitational attraction of the sun 
and moon. The earth is a top with a fully free rotation axis, but it is not free of forces. 
As a result of its flattening and of the tilted ecliptic, the attraction of the sun and moon 
generates a torque. We imagine the earth as an ideal sphere with a bulge upon it, which 
is largest at the equator, and we first consider only the action of the sun. In the center 
of earth (center of gravity), the attraction exactly balances the centrifugal forces due 
to the orbit of the earth about the sun. We divide the bulge into the halves pointing to 
the sun and away from it, respectively. The attraction of the sun on the former half is 
larger than at the earth’s center, because of the smaller distance. The centrifugal force, 
however, is by the same reason smaller. At the center of gravity Si of the half-bulge 
results a force K pointing toward the sun. 

On the side away from the sun, the situation is reversed. Here the centrifugal force 
dominates over the attraction by the sun, and at the center of gravity S 2 of the half¬ 
bulge there is a force — K pointing away from the sun, which—because of the inclina¬ 
tion of the ecliptic—forms a couple with the former force. The couple tries to turn the 
earth axis and the axis which is perpendicular to the earth orbit radius and lies in the 
orbital plane. From this follows the precession motion about the axis perpendicular to 
the earth orbit. The moon acts in the same sense, but even more strongly than the sun, 
because of its small distance. The earth axis rotates in 25800 years (“Platonic year”) 
once on a conical surface with the vertex angle of twice the inclination of the ecliptic, 
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i.e., 47°; it therefore changes its orientation over the millennia. This precession motion 
must be distinguished from the nutations of earth (Chandler’s nutations) discussed in 
Example 13.1. The latter are superimposed on the precession motion. 

Possibly the most important practical application of the top is the gyrocompass. The 
idea goes back to Foucault (1852). The gyrocompass consists in principle of a quickly 
rotating, semi-cardanic suspended top, with the rotation axis kept in the horizontal 
plane by the suspension. 

The earth is not an inertial system; it rotates with the angular velocity (Oe . Since the 
top wants to preserve the orientation of its angular momentum, it is forced to precess 
with oo e • Hence, there is a top moment D r : 

D' = ©3 (Or x (Oe , 
where we set 

c oe = coe sin cpe z + coe cos = (*>e z + 00 e n 

with (p as the geographic latitude, e at is a unit vector pointing along the meridian. By 
splitting ooe one obtains 

D r = ©3 {(Or x (Oe z + (Or x (Oe n ). 

The first term is compensated by the bearing of the top. This part of the top moment 
tends to turn the AB -axis (see Fig. 13.20(b)) The second term causes a rotation of the 
top about the z-axis. The splitting of (Ok leads to the acting torque 




6 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 
(2004), Chapter 28. 


Fig. 13.19. Mass ring caused 
by the orbiting sun as seen 
from the earth 6 


Fig. 13.20. (a) Decomposition 
of the angular frequency (Oe 
into a vertical and a hori¬ 
zontal component, (b) Semi- 
cardanic suspension: This top 
can freely rotate about the 
A B -axis 
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Fig. 13.21. Decomposition of 
the angular velocity of the 
earth (<*>£) and of the top 


Fig. 13.22. Principle of the 
gyroscope 



Y>' = sin oude cos cpe z . 

Hence, a torque arises which always tends to turn the top along the meridian (a = 0). 

If the suspension of the top is damped, the top adjusts along the north-south direc¬ 
tion provided that it is not just at one of the two poles (cp zb 90°). Otherwise, it performs 
damped pendulum vibrations about the north-south direction. One can therefore use 
the top as a direction indicator if one is not close to a pole. 

Foucault’s experiments with a “gyroscope” led only to indications of the described 
effect. Anschiitz-Kaempfe succeeded in constructing the first useful gyrocompass 
(1908). To reduce the friction, the gyroscope body—a three-phase current motor— 
hangs at a float that floats in a basin of mercury. The top axis is kept horizontal by 
placing the center of gravity of the top lower than the buoyancy center (corresponding 
to the suspension point). In this setup the gyroscope axis vibrates under the influ¬ 
ence of the moment not only in the horizontal, but also in the vertical plane about the 
north-south direction. 



By an appropriate damping of the latter of these coupled vibrations, one can also 
reach a damping of the vibrations in the horizontal plane, which is needed for the 
adjustment. The deviations arising from ship vibrations and from other effects could 
be removed in more recent construction (multiple gyrocompass), or accounted for by 
calculation. 


13.5 Further Applications of the Top 

In order to stabilize free motions of bodies, e.g., of a disk or a projectile, these are 
set into rapid eigenrotation (spin). A disk thereby maintains its tilted position almost 
unchanged, and therefore gets a buoyancy similarly to the wing of a plane and thus 
reaches a much larger range of flight than without rotation. A prolate projectile rotat¬ 
ing about its longitudinal axis experiences a torque from the air resistance which tends 
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to turn the projectile about a center-of-gravity axis perpendicular to the flight direc¬ 
tion. The projectile responds with a kind of precession motion which is very intricate 
because of the variable air resistance. The vibrations of the projectile tip remain close 
to the tangent of the trajectory, but for a projectile with “right-hand spin” drift off to 
the right of the shot plane. The projectile therefore hits the target with the tip ahead, 
but on firing one has to account for a right-hand deviation. 

The gyroscope torques acting on guided tops tend to turn up the axes of the wheels 
of a car passing a bend, which causes an additional pressure on the outer wheels and a 
relief of the inner ones. The same gyroscope effect provides an increase of the milling 
pressure in grinding mills and finds further application in the turn and bank indicator 
of airplanes. If the plane performs a turn, the gyroscope actions on the propeller must 
be taken into account. 

The top can also serve to stabilize systems (cars) which by their nature are unstable, 
as in the one-rail track, or to reduce the vibrations of an internally stable system, 
as in Schlick’s ship gyroscope. In the latter device a heavy top with a vertical axis, 
driven by a motor, is set into a frame that can rotate about the transverse horizontal 
axis. During ship vibrations about the longitudinal axis—these “rolling vibrations” 
shall be damped—the top performs vibrations because of the precession about the axis 
lying across the ship; ship vibrations and top vibrations represent coupled vibrations. 
The top vibrations are appropriately damped by a brake. By the coupling the released 
energy is pulled out of the ship vibrations; as a result these are considerably reduced. 
As is seen from the above example, for stabilization by a top it is generally essential 
that its rotation axis is not fixed relative to the body, but that all degrees of freedom 
are available. For this reason a bicycle with a tightly mounted front wheel would not 
be stable. Moreover, riding a bicycle without support is also partly based on the laws 
of top motion. 

An indirect stabilization is used by the devices which control the straight motion of 
torpedoes. On deviation from the shot direction, the top activates a relay which causes 
the adjustment of the corresponding rudder. 

An important problem is to stabilize a horizontal plane so that it remains horizontal 
on a moving ship or airplane. This so-called artificial horizon (gyroscope horizon, 
flight horizon) could, according to Schuler, be realized by a gravitation pendulum 
with a period of 84 minutes (pendulum length = earth radius), since such a pendulum, 
even when the suspension point is moved, points to the earth’s center. Useful artificial 
horizons could be realized by tops with cardanic suspension (“top pendulum,” center 
of gravity below the rotation point). 

We finally note that an ordinary play top that moves with a rounded tip on a hori¬ 
zontal plane, and thus has five degrees of freedom, does not fit the definition of a top, 
since in general no point remains fixed during its motion; i.e., translation and rotation 
motion cannot be dynamically separated. The fact that a play top with an initially tilted 
axis straightens up under sufficiently fast rotation—which also happens, for example, 
with a cooked egg—can be explained by a torque created by the friction. 

EXERCISE 


13.8 Gyrocompass 

Problem. A simple gyrocompass consists of a gyroscope that rotates about its axis 
with the angular velocity co. Let the moment of inertia about this axis be C and the 
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Fig. 13.23. Q 

velocity of the 
the gyroscope 


moment of inertia about a perpendicular axis be A. The suspension of the gyroscope 
floats on mercury, hence the only acting torque forces the gyroscope axis to stay in the 
horizontal plane. The gyroscope is brought to the equator. Let the angular velocity of 
the earth be Q. What is the response of the gyroscope? 

Solution. Since the earth rotates with angular velocity Q, the angular momentum in 
the earth system satisfies 

dL 

— = D- ^xL, 
dt 


x 

is the angular 
earth, co that of 


where D is the total torque. At the equator Q points along the y-axis, and the z-axis is 
perpendicular to it. 

The components of the angular momentum are 
L x — Ccosincp, 

CO 

L y = Ccocoscp, — 1, 

L z = Acp. 


We suppose that (p is small. Then 
L x = Ccocp, 

CO 

L y = Cco, — < 1, 

L z = —Acp. 

Since there are no forces acting in the x,y-plane, D z = 0. Hence, the equation for L z 
is 

Acp = —CcoQcp 


or 

C C 

cp-\ - coQcp = 0; i.e., cp + co^cp — 0, co^ = —coQ. 

A A 

cp oscillates with the frequency 
' C \ 1/2 


—coQJ 
in the north-south direction! 


EXERCISE 


13.9 Tidal Forces, and Lunar and Solar Eclipses: The Saros Cycle 7 

Problem. Theancient Chinese court astronomers were able to predict lunar and solar 
eclipses with great reliability. The fact that such eclipses arise only occasionally— 


7 The name goes back to the Chaldeans , a Babylonian tribe. Thales presumably used Babylonian ta¬ 
bles for predicting the solar eclipse in 585 B.C. The knowledge of natural science of the Babylonians 
was highly developed. They had tables for square roots and powers, approximated the number jt by 
31/8, and could solve quadratic equations. The subdivision of the celestial circle into 12 zodiacal 
symbols and the 360° division of the circle are modem examples of Babylonian nomenclature. 
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Fig. 13.24. 


while otherwise we have a full moon or a new moon—is caused by the inclination of 
the orbital plane of the earth-moon system from the ecliptic, i.e., the orbital plane of 
the motion of the common center of gravity about the sun. This inclination is about 
5.15°. It is not fixed in space but precesses because of the tidal forces exerted by 
the sun. This leads to the so-called Saros cycle, which is of great importance for the 
prediction of eclipses. 

Consider the earth-moon system as a dumbbell-shaped top which rotates about 
its center of gravity S p \ the center of gravity orbits about the sun on a circular path. 
The gravitation force between the earth and moon just balances the centrifugal force 
resulting from the eigenrotation of the system and thus fixes the almost rigid dumbbell 
length ro. The gravitation of the sun and the centrifugal force due to orbiting about the 
sun don’t compensate for each body independently but lead to resulting tidal forces. 
These forces create a torque Mo on the top. Calculate Mo for the sketched position 
where it just takes its maximum value. Realize that Mo on the (monthly and annual) 
average has a quarter of this value. Calculate from this the precession period T p . Can 
you find arguments for why the actual Saros cycle of 18.3 years is notably longer? 

Hint: The only data you need for the calculation are the distances ro, the length of 
year, and the length of the sidereal month. 

Solution. Ro is defined as the vector pointing from the center of gravity of the sun to 
the center of gravity of the earth-moon system. The coordinate origin of the system is 
in the center of gravity of the sun. 

Let R be given in cylindrical coordinates: 

R = R 0 + AR 

— Ro^r T - AR^e^ T- T- AR^e^ 


with 

|AR| « |R 0 |. 

We then have 


IRI 


tfo 1 + 


2A R r 


Ro 


1/2 



Rn 



Fig. 13.25. The adopted unit 
vectors 


sun 


e 


earth 


cm 
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Hence, we can write for the gravitational force 


ymM 

F Gr (R) = -4r^R 


|R |3 

ymM 

Rl 


0-t) 


ymM 


R, 


(Ro ~ 2A R r ) I + e, 


(/?0^r T - AZ^e^ + A7?<^6^ T A R^e^ 
ymM 


+ e z - 


(-^4 


( ymM \ 

■^r a 4 


(13.27) 


The two masses m and M don’t yet have a special meaning. We now consider the 
motion of the earth-moon system as a two-body problem with an external force: 

777 l R l + 777 m • R M 

Rcm := Ro =- (CM means center of mass), 


VcM = 
Bcm = 


mE + mm 
wieVe + rriM^M 
mE+rriM 
m e B e + rriM^M 
m E +m M 

¥ E s + ^em + F M e + Jms 


mE + mm 

According to (13.27), 


0 S means sun). 


AR ^ = ye, 

ARm = I'M, 

Me*E = -Mm^M\ 


further from (13.27) we have 


Bcm 


1 


me +m m 
yM cm 


-y- 


(m E +m m )M s 


RA 


• e r — 


^CM 


Ro • e r (circular acceleration). 


(13.28) 


The last equality follows from the equilibrium condition for the center of gravity. The 
magnitude of the gravitational acceleration must be equal to the magnitude of the 
circular acceleration. From this, it follows that the center of mass CM rotates with the 
frequency cocm about the sun at the distance Ro. 

We further know the following values: 


7cm = ——- = 365 days; Ro = 149.6 • 10 6 km. 

<^CM 


(13.29) 

















13.5 Further Applications of the Top 


233 


We are mainly interested in the motion of the earth-moon system. To this end we 
consider the relative distance r re i between the earth and moon. 


r rel 

Rrel 

^Prel 

dt 

F EM 


— R E Rm> l r rell —^0? 
= ^(Ve-Vm) with /x = 

= I x(B e -Vm)=J— + 


me • mm 
m e +m m ’ 

^ EM F me 


m e 


m e 


m m 


m e m m 

-Y o r rel — 


-mco M r re i. 


Fm^ \ 
m m ) 


The last equality holds because of the equilibrium condition, as for the center-of-mass 
acceleration. 

The relative distance also performs a circle with the sidereal period of the moon at 
the distance tq. One has 


Tm =-= 27 days + 8 hours, 

cdm 

r 0 = ve + em = 0.384 • 10 6 km. 


(13.30) 


We thus obtain the following combined motion: 
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R e = R() + 


R m = Ro — 


m m 

mm +m e 
m e 

mm +m e 


r rel 

r rel 


epicycle motion. 


(13.31) 


The angular momentum with respect to the center of mass CM is given by 


Lo = r E x tfiEiy e — Vcm) + I’m x mm(Vm Vcm) 

m m ( m m \ 

= m E ---r re i x --- v re i 

m e +m m \m e +mmJ 

m e ( m e \ 

H - Mm Trel X I ) V re l 

me+mm \me + mm ) 

M E M M 

=-;-r re i X v re i 

Me +mm 

= 11 • r re i X v re i 

MeMm 2 , 

~-;- ojm • r {) • 1 em, 

me-Emm 


(13.32) 


where 1 em represents a normal vector to the orbital plane of the earth-moon system. 
The relation (13.32) does not hold exactly, since the motion of the earth-moon system 
is not perfectly circular, but can be well approximated by a circle. 
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The coordinate system at the center of gravity is oriented just as at the origin. The 
total angular momentum with respect to the sun is evaluated as 

L to t = m e Re xV £ + rriM • Rm x Vm 

fjy . m M \ (-J m M \ 

v m M +rn E J \ m M +m E ) 

( m E \ ( rn E \ 

Ro-;-r re i x V C m ---v re i 

m M + m E ) \ m M +m E ) 

= (m E + m M ) Ro x V C m + H • r re i x v re i 

= (m E + rriM)coc M • ' h-CM + Lo 

= Lcm + L 0 . (13.33) 

Similar as in (13.32), here it also holds that ls-CM is a vector normal to the orbital 
plane defined by the sun and the center of gravity. 

— (R e x F E s + R m x Fm,s + (R e R m) x F em) = 0 ; 

this means 

Lcm = —Lo- (13.34) 

We now consider the resulting torque Mo with respect to the center of mass CM: 

Mo = r E x (F E s + F em wie^cm) f^x (F ms + F me ^m®cm) 

= r/, x F es + I’m x Fm,s- 

The second terms in each bracket drop because the force and position vector have the 
same orientation. The third two terms cancel because 

r E m E = -r M m M - 


By inserting r re i, it follows that 

mm m e 

Mo = -;-r re i X F ES ---rrei X F MiS . 

m e +m M m e -\-m m 

Using (13.27) for the two force vectors F^s and F ms, one obtains by simplifying 
Mo with respect to cylindrical coordinates: 


M 0 = 


3 yM E M 

Rl 


(A7? r £ , (r re i x e r )). 


(13.35) 


Here, A R^e and A R zE were set to zero, according to the definition of the problem. In 
order not to complicate the problem unnecessarily, we put the coordinate system at the 
center of gravity and thereby also that at the origin just so that the angular momentum 
on the average lies in the x, z-plane. This approach is justified since the precession 
frequency to be calculated is notably less than &>cm- During one revolution about the 
sun the angular momentum has changed only insignificantly (by about 20°), so that 
the ecliptic of the earth-moon system has turned only slightly. 
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Fig. 13.26. (a) The center-of- 
mass motion about the sun is 
parametrized by the angle y. 
(b) The ecliptic plane is tilted 
by a from the plane spanned 
by the center-of-mass motion 
about the sun 


Ansatz: 


cost 


r' el = ro( sin/3 |, where /3~coMt. 
0 


In the nonprimed system the vector has the components 


'cos a 0 — sin a N 

Trel = | 0 1 0 

sin a 0 cos a 


Ansatz: 

( cos(y +(po)' 
sin (y + cpo) 

0 

Using the relation 

m M 


r rel = r 0 


' cos a cos /3 N 
sin /3 
sin a sin 


where y ~ cocMt- 


(l*rel ’ C r ), 


A R r E = 

me + mM 

one obtains, by using the two approaches, the following new formulation of (13.35): 

3 ym E M 2 m M 

M o = —rr— r o- v 


RZ 


m E +mm 


with 


/ — sin a cos a cos 2 /3 sin(]/ + <^o) cos(y + <po) — sin a cos/3 sin/3 sin 2 (y + ^o)\ 

^ _ sin a cos a cos 2 /3 cos 2 (y + cpo) + sin a cos /3 sin sin(y + cpo) cos (y + cpo) 

cos 2 a cos 2 /3 • sin()/ + cpo) cos(y + cpo) — sin /3 cos /3 cos a cos 2 (y + <po) 

{ + cosacos/3 • sin/3 sin 2 (y + <^o) — sin 2 /3sin(y + <po)cos(y + <^o) ) 

This clumsy expression can be significantly simplified, assuming again that cop ^ 
&>CM- This means that the angular momentum Lo changes only slightly during one 
revolution about the sun. 

We first consider The revolution period of the moon about the earth is about 
28 days. The moment Mo changes its orientation with varying for the “inert” angu¬ 
lar momentum, however, only the average momentum (Mq)/ 3 counts; this is obtained 
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by averaging over a full period of /3. The moment impact (analogous to the force 
impact) of Mq and of (Mo)/? has the same value, because of the linearity /? = a>Mt. 


(M 0 )p = 


3yM/ji 


R t 


— sin a cos a - sin(}/ + (po ) cos(y + (po ) 

1 ? 

sin a cos a - cos (y + cpo) 

2 1 1 

y cos ol — sm(]/ + (po) cos(y + <po) - - sm(y + ^ 0 )cos(]/ + cpo) J 


The same consideration can be made for the “rotating” angle y ~ oocMt, since we 
assume that co p coqm- We therefore average over (Mo)/?: 


((Mo )P)y = 


3yMfjL 

- 


sin a cos a 


3 yMji 2 1 . ... . 

Tq • -sina cos a • := (Mo). 


$ 0 4 


(13.36) 


Not very much is left over from the extended expression; the resulting acting moment 
(Mo) is exactly perpendicular to Lo (see Fig. 13.27) and points along the y-direction. 
If the angular momentum moved (slowly), we imagine the shifted angular momentum 
as being embedded in a fixed coordinate system and thus get, according to (13.36), 
the same result. (Mo) is constant and always perpendicular to Lo. It therefore causes 
a precession. 


Fig. 13.27. Meaning of the 
angles a, ft, y at one glance: 
P describes the rotation of the 
earth-moon axis in the primed 
coordinate system 



But first we will illustrate (13.36) that has been obtained in a rather mathematical 
way: In this situation, we get a maximum moment. The moment M(/?) now points in 
the y'-direction for all possible /?. For ft = 90° or ft = 270°, the vector M vanishes. 
On the average, we thus obtain 


(M(/?))=2(M 0 ). 
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Fig. 13.28. Orientation of e r 
as a function of y + (Pq: 
The averaged moment (M)/3 
always points along the y'- 
direction 


The diagram shows that there are also two “maximum” orientations with respect to 
(y + <Po). Between these positions (M(/3)) must vanish. One thus obtains altogether 

((M(/3, y))p)y = (M 0 ). 

Thus, the result (13.36) is also clearly understood. 

With (13.32) in our defined coordinate system, we have 



L 0 = /x • (d m • 


The angular momentum Lo now precesses: 
Ms (Mo) 

(D p = - = - 

L • sina Lo • sin a 

3 yMsIjltq sin a cos a 3 yMs 


4 R 3 


COS O' 


4 R^/icdm ?q sin a 

It has been shown at the beginning (13.28) that 

yM s 


1 

cdm 


Y M S _ 2 r, 

~r — ^cm^o 

R, 


0 

and with 


Ln 


J CM 


-& 


(dm ■ 


27T 

Tm ’ 


(13.37) 


one gets 


(D p = - cos a j 


'4tH 


*CM 


Tm 3 Tm 
= - cos a — 


27T 


z CM 


and for T n 


2;r 4 

T n = — = - • 


1 


CM 


3 cosa Tm 


With Tcm ~ 365.25 days, Tm ~ 27.3 days, and o' ~ 5.5°, one obtains 
T p ~ 17.9 years. 


(13.38) 


(13.39) 


(13.40) 
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Fig. 13.29. Demonstration of 
the noncommutativity of finite 
rotations 


The fact that the actual Saros cycle is larger by about 2% is partly due to the ap¬ 
proximation when averaging y (the angular momentum actually moves slightly), but 
possibly due to the elliptic path of the moon about the earth. In any case, the result is 
relatively accurate, considering the approximations made. 

From (13.34), it further follows that 


Lcm = — Lo> 


i.e., the “large” angular momentum vector Lcm runs through an opposite precession 
cone. 


13.6 The Euler Angles 

The motion of the heavy top suspended at a point can be described by specifying 
the orientation of a body-fixed coordinate system (x r , y ', z') relative to a space-fixed 
system (x, y, z). The two coordinate systems have a common origin at the suspension 
point of the top. To establish the relation between the two coordinate systems, one 
usually adopts the Euler angles . 8 

The coordinate system (x', y', z') is obtained from the system (x, y, z) by three 
subsequent rotations about defined axes. The corresponding rotation angles are called 
Euler angles. The sequence of the rotations is important, since rotations by finite an¬ 
gles are not commutative. In Fig. 13.29 we see at once that a permutation of the se¬ 
quence of two rotations about different axes leads to a different result. 



A rotation by 90° about the x-axis, followed by a 90°-rotation about the y-axis 
(upper figure) leads to a different result than rotating first about the y-axis and then 
about the x-axis (lower figure) (noncommutativity of finite rotations). 

The Euler angles are defined as follows: The first rotation is performed about the 
z-axis by the angle a. The x- and y-axis turn into the X- and T-axis. The Z-axis 


8 Leonhard Euler , Swiss Mathematician, b. April 15, 1707, Basel, Switzerland-d. September 18, 
1783, Saint Petersburg, Russia. Son of a clergyman, Euler studied mathematics in Basel with Johann 
Bernoulli, and in 1727 was appointed professor of physics and mathematics at the university of Saint 
Petersburg. With an extended break from 1741 to 1766 as a member of the Berlin Academy of Sci¬ 
ences, he spent the rest of his life in Saint Petersburg. Euler authored more than 800 scientific papers 
on nearly all subjects in mathematics, especially on calculus, calculus of variations, and the theory 
of complex functions. He contributed eminently also to the theory of numbers, and his solution to 
the problem of the Seven Bridges of Konigsberg laid the foundation of graph theory and topology. In 
physics, Euler worked mainly in hydrodynamics, the theory of elasticity, and the theory of the top. 
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coincides with the z-axis. The so defined X, Y, Z system is a. first intermediate system 
which is only used to keep the calculation transparent. For the unit vectors, we have 

i = (i • 1)1 + (i • J)J + (i • K)K = cosed — sinaj, 

j = (j 1)1 + (j J)J + (j K)K = sin cd + cosaj, (13.41a) 

k = (k 1)1 + (k J)J + (k K)K = K. 

The second rotation is performed about the (new) X-axis by the angle /3; the Y- 
and Z-axis turn into the Y' - and the Z'- axis. The Z'-axis coincides with the Z-axis. 
The X' ,Y', Z' system fixed in this way is a second intermediate system. It serves for 
mathematical clarity and transparency, just as the first intermediate system did. An 
analogous calculation yields for the unit vectors 

1 = 1 ', 

J = cos J' - sin/3K', (13.41b) 

K = sin /3J' + cos /3K'. 

The third rotation is performed about the Z'-axis by the angle y; the X '- and Y’~ 
axis then turn into the x’- and /-axis, respectively. The z'-axis is identical with the 
Z'-axis. The x', /, z' system constructed this way is the desired body-fixed coordinate 
system. For the unit vectors, one obtains 

I' = cos y i' — siny j', 

J' = sinyi' + cosyj', (13.41c) 

K' = k'. 


Fig. 13.32. All three Euler an¬ 
gles 


x’ 


Using the relations between the unit vectors, we now determine the unit vectors 
i, j, k as functions of i', j', k'. For this purpose, we insert 

i = cosed — sin a J 

= cos cd' — sin a cos f j' + sin a sin /3K f 
= cos a cos y i' — cos a sin y j' — sin a cos f sin y i' 

— sin a cos f cos y j' + sin a sin f> k' 

= (cos a cos y — sin a cos f sin y )i' 

+ (—cos a siny — sinacos^cosy)]' + sin a sin/3 k'. 


z, Z 




(13.41d) 





240 


13 Theory of the Top 


An analogous calculation yields 

j = (sin a cos y + cos a cos /3 sin y) F 

+ (— sin a sin y + cos a cos/I cos y)f — cos a sin /3k r , (13.41e) 

k = sin/? sin y F + sin/3 cos y j r + cos pk f . 

The rotations can also be expressed by the corresponding rotation matrices. For the 
first rotation, we have 

r = AR, 


where 



/ COS O' 

— sin a 

°\ 

A = 

I sin a 

COS O' 

0 


V 0 

0 

i/ 


and R = 



The matrices for the rotations by the angles /3 and y accordingly read 

- f 1 0 0 ) 

5= 0 cos/3 — sin P I , 

\0 sin/3 cos/3 / 


( cos y — sin y 0 \ 

siny cosy 0 I . 

0 0 l) 

The matrix of the entire rotation D is the product of the three matrices D = ABC. 
Hence, 

r = Dr' or r' = D~ l r = Dr. 

Since the rotation matrices are orthogonal, the inverse matrix equals the transposed 
one. By calculating the matrix product, one can easily show that the matrix D agrees 
with the relations derived for the unit vectors. (This agrees with the general consid¬ 
erations from Chap. 30 of Classical Mechanics: Point Particles and Relativity of the 
Lectures on Theoretical Physics.) 

We first calculate the angular velocity oo of the top as a function of the Euler angles. 
If (i, j, k) define the laboratory system and (F, j r , k r ) a body-fixed system of principal 
axes, for the angular velocity we have 

oo = cDqi k T cop IT (Dylfc! — dk T P IT y K 7 , 

where we presuppose that k, I, and K' are not coplanar. We utilize the derived relations 
between the unit vectors and obtain 

oo = a sin /3 sin y i' + a sin /3 cos y j 7 + a cos p k f + /3 cos y F — /3 sin y j r + yk f 
= (a sin P sin y + P cos y)F + (a sin P cos y — P sin y) j r + (a cos P + y) k r . 
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Setting co = oy i' + coy f + coy k 7 , we get the components of the angular velocity rela¬ 
tive to the body-fixed coordinate system: 

coy = co\ = a sin p sin y + p cos y, 

coy'= 02 = a sin cos y — sin y, (13.42) 

COy = 02=01 cos p + y . 


The kinetic energy T of the top is then 

T = — (0 1 o 2 + @ 2^2 © 3 ^ 3 ) 

1 . 9 

= -©1 (a sin/3 sin y + ft siny) 

1 . 2 

+ -© 2 (<* sin p cosy — P siny) 

+ -© 3 ( 0 : cos P + y) 2 . (13.43) 

If © x = © 2 , i.e., the top is symmetric, the above expression simplifies to 

T = ^©i(a 2 sin 2 P + p 2 ) + ^ ©3 (o' cos P + y ) 2 . (13.44) 


13.7 Motion of the Heavy Symmetric Top 

For the special case of the heavy symmetric top, we will determine the explicit equa¬ 
tions of motion and the constants of motion, starting from the Euler equations. For 
simplification, we note that for the symmetric top the two orientations of the princi¬ 
pal axes ey, ey can be arbitrarily chosen in a plane perpendicular to ey. We therefore 
choose a coordinate system where the angle y always vanishes. This system is then 
no longer body-fixed (it does not rotate with the top about the e z /-axis). The axes 
ey, e z , ey are then coplanar, as are e x ,e x r, e y . This is illustrated in Fig. 13.33. 



Fig. 13 .33. Heavy top in vari¬ 
ous coordinate systems 


X 









242 


13 Theory of the Top 


Analytically, this follows from the fact that 

e' x = t = I = cosed + sin a j = cosa e x + sin a e y , 

e f y = J = cos /3 J + sin /3 K = cos /3(— sinai + cosa j) + sin/3 K 

= — sinacos^e* + cosacos /3e y + sin /3e z , (13.45) 

e' z =K' = (— sin/3)J + cos/IK = — sin/3(— sin cd + cos a j) + cos/I K 
= sin a sin p e x — cos a sin p e y + cos f3 e z . 



Fig. 13.34. Precession and nu¬ 
tation of the angular momen¬ 
tum: The figure axis points 
along Ly 


We thereby have inverted the relations (13.41a) and (13.41b). By means of the expres¬ 
sions for e^., , e ' z , one can now easily check the triple scalar products < • (e- x e y) 

and e x ■ (e' x x e y ), e.g., 


1 0 0 \ 

cos a sin o' 0 1=0. 

0 10 / 

Similarly, one shows the vanishing of the other triple scalar product and thus confirms 
that the corresponding vectors are coplanar. 

The coordinate system thus follows the precession (with a) and the nutation 
(with P) of the top, but not its eigenrotation. To realize that P describes the nutation, 
we note that a nutation motion of the figure axis is superimposed onto the precession 
(compare the discussion in the section “Elementary considerations on the heavy top”). 
This manifests itself for p in an up-and-down motion (vibration) about a fixed value 
Po (see Fig. 13.34). 

For the angular velocities (13.42) of the ey,ey,ey system (which is only partly 
body-fixed) relative to the laboratory system e x , , e z in this system (y = 0) we have 


e* • (e' x e v ) = det 


= co x > = $, 

a >2 = coyf = a sin/3, (13.46a) 

( 02 , = coy =a cos p, 


or 


co = /3ey + a sin pey + a cos p ey. (13.46b) 

The angular velocity of the top, on the other hand, is 

(Ok = coyey + coyey + (coy + co o)e z / 

= pey -basin /ley + (a cos P + &>o)ey. (13.47) 

Here, coo is the additional angular velocity of the top relative to the e x r , ey , ey system. 
The angular velocity coo(t) in general depends on the time. We must take care also 
when calculating the angular momentum, because in this particular ey , ey , ey system 
the rigid body still rotates with the angular velocity cooey. We can call this additional 
rotation spin. It is due to the particular choice of our (not exactly body-fixed) system 
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e x f , ey, e z f . The angular momentum is then 

L = (r)co/ x = ®i(L>ie x r + © 2 &> 2 Cy + ©3(^3 + &>o )ey 
= © 1 /?ey + © 2 d sin /? ey + ©3 (a cos ft + cuo)ey 

and the Euler equations read 
Lllab = Lie' +o> X L = D. 

Note that the spin rotation coo appears in L but not in co ! 

The torque about the origin of the space-fixed system is 

D = (/ • ey) x (— mge z ) = mgl sin ft ey , 

because e' x e z = — sin p e! x , as is easily seen from (13.45). By inserting this into the 
Euler equations (13.5) and (13.49) and noting that ©1 = © 2 , we obtain 

mgl sin p = + (©3 — ©i)d 2 sin P cos P + © 3 &>od sin p, 

0 = ©i(asin/l + aft cos P) + (©1 — © 3 )d/l cos P — © 3 ^ 0 P, (13.50) 

0 = ©3 (a cos P — dtp siny 0 + cbo). 

From the above system of equations, a(t), P(t ), and coo(t) can be determined. From 
the third equation, we have, because ©3 ^ 0 , 

d 

a cos p — ap sin p + cbo = —(dcos p + coo) = 0 (13.51a) 

dt 

or 

d cos + wo = A = constant, (13.51b) 

i.e., the angular momentum component Ly = ©3 A (see (13.48)!) about the figure axis 
is constant. 

We therefore set a cos P + coo = A, calculate coo from this and insert it into the first 
two equations. We then obtain two coupled differential equations for precession ( a ) 
and nutation ( P ), respectively: 

mgl sin P = © 1 P — ©id 2 sin/? cos P + © 3 A sin p • a , (13.51c) 

0 = ©i(dsin P + 2d/?cos/?) — ©3 Ap. (13.5 Id) 

We first investigate this system for the case that the top performs no nutation. Then 
P = P = 0, and P > 0. By insertion, we obtain 

mgl = — ©id 2 cos/3 + ©3 Ad, a = 0. (13.52) 

The second equation means that the precession is stationary. 

From the first equation, we determine the precession velocity d: 

© 3 A / / 4mg/©i cos/? \ 

2 ©! cosp y Y © 2 A 2 J 


(13.48) 

(13.49) 


a = 


(13.53) 






244 


13 Theory of the Top 


For a top rotating quickly about the e z /-axis, A becomes very large, and the fraction 
in the radicand becomes very small. We terminate the expansion of the root after the 
second term and get as solutions to first order for d sma ii: 

mgl . 

Ismail — ~ , > (13.54a) 

@ 3 a 

to zeroth order for di ar ge> we have 


a = 


6,3 A 
©1 COS /3 


(13.54b) 


A stationary precession without nutation (regular precession) occurs only if the heavy 
symmetric top gets a certain precession velocity («smaii or charge) by an impact. In the 
general case the precession is always coupled with a nutation. The heavy top will al¬ 
ways begin its motion with an deviation toward the direction of the gravitational force, 
i.e., with a nutation. We still note that e-mail agrees with the precession frequency 


mgl mgl mgl 
P L ©3&>0 ©3 A 


obtained in the section “The heavy symmetric top: Elementary considerations.” 

Before we continue the discussion on the general motion of the top, we determine 
additional constants of motion. We have already seen that from the last equation of the 
system (13.50) we have 


a cos /3 + coo = A = constant, 


(13.51b) 


hence, the corresponding part of the kinetic energy is 
1 9 1 9 

73 = -©3(0? cos/3 + cooj = -©3 A 2 = constant. (13.55) 

Multiplying the first of the Euler equations (13.50) by ft and the second one by a sin/3, 
after addition one gets the total differential 

mgl sin/3 • ft = ©i/3/i + ©i(ad sin 2 /3 + a 2 /3 sin/?cos/0 


or 

3- ( mgl cos /?) = + l©ia 2 sin 2 ^. (13.56) 

This means that the energy (more precisely, the sum of the kinetic parts T\ + T 2 plus 
the potential energy) 

E' = 1 ©! 0 2 + a 2 sin 2 p) + mgl cos p (13.57) 

is also a constant of motion. This must be so, of course, since the total energy of the 
top must be constant. 

The total energy of the top is then 

e = e' + r 3 

= -©i(/3 2 + a 2 sin 2 j3) + -© 3 (d cos /3 + coo) 2 + Mgl cos j3. 


(13.58) 
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The last term obviously describes the potential energy of the top in the gravitational 
field. The energy law (13.58) must of course hold in general. We could have written 
it down immediately and skipped the derivation (13.56) from the Euler equations. 
Nevertheless, it is of interest to see how the equations of motion succeed too. 

In the second Euler equation (13.5Id), we insert 

Ly = ©3 A = constant (13.59) 

and multiply by sin p. This yields 

©i(d sin 2 P + 2a P sin ft cos ft) — Ly sin/3 • ft = 0. (13.60) 

Since Ly is constant, this is a total differential, and then it follows that 

©1 (a sin 2 /3) + Ly cos /3 = constant. (13.61) 

This constant is the z-component of the angular momentum in the space-fixed system. 
This is seen immediately if we multiply the angular momentum 

L = &\{coyey -\- &>yey) L z /e z / (13.62) 

by e z . From (13.45) or from Fig. 13.33, we see that 


e x f • e z = 0, ey • e z = sin y 3 , ey • e z = cos ft. (13.63) 

By noting that coy = a sin ft, we see that 

L • e z = L z = ©id sin 2 /3 + Ly cos ft = constant. (13.64) 

The two angular components L z and Ly are constant, because the moment of the grav¬ 
itational force acts only in ey -direction, i.e., perpendicular both to the z- as well as to 
the z 7 -axis. The conditions L z = constant and Ly = constant 7 can be realized by a pre¬ 
cession of L about the z-axis, and an additional rotation of the z '-axis about the L-axis. 
The latter motion is the nutation. This obviously means that the angular momentum L 
precesses about the laboratory axis e z , and the figure axis ey simultaneously performs 
a nutation about the angular momentum L. 

With the constants of motion we will now further discuss the motion of the top. 
From the equation of the angular momentum component L z in the laboratory system, 

©id sin 2 /3 + Ly cos /3 = L z , (13.65) 


we determine a : 

L z — Ly cos /3 

a = - 3 -, 

©1 sin 2 P 

and insert this into (13.58): 


(13.66) 


1 

2 




+ 


(Lz ~ cos /?) 2 

201 sin 2 /} 


+ T 3 + mgl cos P 


E. 


Since L z , Ly , 73 , and E are constants of motion, this is a differential equation for the 
nutation p(t). We now substitute 


u = cos p, 


(13.67) 
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then u = — sin /3 • /? and sin 2 /3 = 1 — u 2 . From this, we get 
1 


-©i 


u 2 


+ 


(L z - L z iu) 2 


2 1 -u 2 2@i(l — u 2 ) 


+ mglu = E — T 3 


or 


. 2 (L z — Lyu) 2 2mglu(\ — u 2 ) 2(1 — u 

" + if + -el— = — 

With the abbreviations 

, E - 73 2 mgZ L z 

s — 2 -, t =-, y = —, 

©1 ’ ©1 ^ ©1 

this equation can be written as follows: 
u 2 — (s — %u)( 1 — u 2 ) — (y — 8u) 2 . 


2 ) 


(E-T 3 ). 



(13.68) 


(13.69) 


(13.70) 


(13.71) 


It cannot be solved by elementary methods. We therefore give the graphical rep¬ 
resentation of the functional dependence. In the following we use the abbreviation 
u 2 = f (u). For large u the leading term is w 3 , i.e., the curve approaches f(u) = §w 3 . 
For /(1) and /(—1), we have 

f(l) = -(y-S) 2 <0, f(-l) = -(y + 8) 2 <0. (13.72) 

From this, we obtain Fig. 13.35. 


Fig. 13.35. Qualitative trend 
of the function f(u) 



In general, the function f(u) has three zeros. Because of its asymptotic behavior 
for large, positive u and because /(1) < 0, for one zero we have u 3 > 1. 

For the motion of the top, we must have u 2 > 0. Since 0 < < tt/2, we have 0 < 

u < 1. To ensure that the top moves at all in the physically relevant region 0 < u < 1, 
in a certain interval of this region we must have u 2 = /(«)> 0. Hence, for physical 
reasons two physically interesting zeros u\, U 2 must exist between zero and unity. 
Therefore in the general case there are two corresponding angles /3\ and P 2 with 

cos/3i=wi and cos^2 = ^2- (13.73) 

In special cases, we can have (1) u\ =U 2 and (2) u\ = U 2 = 1. We first consider these 
special cases: 

(1)^1 = U 2 2 / 1 1: The tip of the figure axis orbits on a circle (this is called stationary 
precession ); no nutation occurs (the angle /3 has a fixed value). According to (13.66) 
the precession velocity reads 
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a = 


y — 8u 
1 — u 2 


(13.74) 


and is constant. 

(2) u\ = U 2 = 1: In this case, the figure axis points vertically upward. The top 
performs neither nutation nor precession motion (,sleeping top). This is obviously a 
special case of the stationary precession (compare Exercise 13.8). 

In the general case ( u\ ^ U 2 ), a nutation of the top is superimposed on the preces¬ 
sion between the angles P\ and fo. According to the angular momentum law (13.65), 
(13.66) for the precession velocity, we have 


L z — L z f cos p y — 8u 
©i sin 2 P 1 — u 2 


(13.75) 


The zeros of this equation, i.e., the solution of a(u) = 0, specify those angles ft at 
which the precession velocity a momentarily vanishes. In order to illustrate the gyro¬ 
scope motion, we give the curve described by the intersection point of the figure axis 
on a sphere centered about the bearing point. 

There are three different types of motion, as illustrated in Figs. 13.36, 13.37, 
and 13.38. 


( 1 ) y/8 = U 2 : The precession velocity just vanishes at P 2 ’, hence, the peaks appear. 

(2) u\ < y/8 < U 2 ‘ The upper peaks at ft 2 extended to loops. The precession velocity 
vanishes between P 2 and p >\. 

(3) y/8 > U 2 : The precession velocity would vanish beyond P 2 (as indicated in 
Fig. 13.38). A peak cannot arise. 


EXAMPLE 





13.10 The Sleeping Top 

In the case of the so-called “sleeping top,” the figure axis points up vertically, so that 
neither nutation nor precession occurs. For this special case, we must have ft = 0 and 
p = 0. 

From energy conservation, we obtain 

-Oi(l3 2 + d 2 sin 2 P) + -© 3 A 2 +mgl cos P = E, (13.76) 

and because 

p = 0 , p = 0 , 
it follows that 

@ 3 A 2 = 2(E -mgl). (13.77) 

Constancy of the z-component of the angular momentum yields 
©id sin 2 P + ©3 A cos P = constant = K, 


(13.78) 
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Fig. 13.39. 


from which follows (A = co 3 + coo = constant) 


® 3 A = K. 


(13.79) 


For the quantities e, §, y, and 5 in the differential equation for the nutation motion in 
u = cos p, we have 


_ 2(E- (1/2)03 A 2 ) _ 2mg/ 

©1 ©1 

_ 2mgl 

S 0 ? 

©1 

/T © 3 A 

k= © 7 =_ ©r’ 

©i 

=>> £ = § and y =8. 

Inserting this into the differential equation (13.71) for u, one obtains 
u 2 = f(u) = e(l — u)( 1 — w)(l + u) — y 2 ( 1 — u) 2 
/(«) = (1 - w) 2 [e(l + u) - y 2 ]. 


(13.80) 


(13.81) 

(13.82) 


Equation (13.82) has a twofold zero, which will be denoted by U 2 = u 3 = 1 or u\ = 
U 2 = l (compare Fig. 13.39). The third zero is at 

v 2 ©?A 2 

1 = ^- 7-1. (13.83) 


©i2mg/ 

Accordingly, f(u) has one of the two courses (see Fig. 13.39). 




For Fig. 13.39(a), p actually vanishes since f(u) has a zero. Thus, we can have 
P = constant / 0, i.e., the case of stationary precession. But since we also require 
P = 0 for the “sleeping top,” only Fig. 13.39(b) is left over where P / 0 does not exist 
as a solution ( u\ > 1 ). 

Hence, from (13.83) we obtain as condition equations for the “sleeping top”: 


©fo 2 

©i 2 mgl 


a 2 > 4mg/Qi 
- © 2 


(13.84) 


Equation (13.84) will be satisfied only in the initial phase of the gyroscope motion. 
Because of friction, A 2 = ( co 3 + coq ) 2 decreases, so that 


A 2 < 


4mg/©i 


©3 
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and therefore one observes precession with overlaid nutation. Further energy loss in¬ 
evitably causes the top to tilt down. 


EXAMPLE 

13.11 The Heavy Symmetric Top 


(a) Write the total energy of the top as a function of the Euler angles. 

(b) Determine the constants of motion, and use them to eliminate the Euler angles 
a and y from the energy law. Propose approaches for solving the resulting one¬ 
dimensional differential equation: 

E= l -®xm 2 + VM. (13.85) 

(c) Discuss the effective potential V Q ff(P), and solve the differential equation of the 
heavy top for infinitesimal displacements from the stable position in the minimum 
of the potential: 

P (0 = A) + *7 (0 • 


Fig. 13.40. 


8 


We consider the symmetric top bound to a fixed point in the gravitation field. The 
energy law reads 

E = T + V, (13.86) 

where 

T = and V = M-g-h. (13.87) 

i 

M is the mass of the top, and h = l cos /3 is the distance between the center of gravity 
and the bearing plane. 

In order to express the angular velocity co = (a) x f, coy, coy) by the Euler angles and 
their time derivatives, we note that a, ft, y are rotation velocities by themselves. The 
rotation velocity of the body is obtained as the vector sum 

co = co a + (dp + (Oy = oit a + /3ep + yGy . (13.88) 
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The vectors e y , e a , follow from the definition of the Euler angles: 
y: Rotation about the new (body-fixed) z'-axis: 

e y = ey = (0,0,1). (13.89) 

or. Rotation about the space-fixed z-axis: 

e a = e z = (sin /3 sin y, sin /3 cos y, cos /3). (13.90) 

fi: Rotation about the nodal line, x-axis: 

=e x = (cosy, — siny,0). (13.91) 

One thus obtains the components of the rotation velocity in the body-fixed coordinate 
system 

coy =ooi = a sin /3 sin y + /3 cos y, 

coy = co 2 = a sin /3 cosy — /3 sin y, (13.92) 

coy = C 03 = a cos /3 + y. 


By inserting this into (13.87) and using ©1 = © 2 , we obtain for the kinetic energy 

T = i© 1 (d 2 sin 2 /? + p 2 ) + l© 3 (dcos^ + y) 2 . (13.93) 

Since the gravitational force acts only along the z-direction, the torque acts only along 
the nodal line ep. 

D = r x F = —Mgley x e- 

= —Mgl sin Pep. 

Thus, the angular momentum components in the e z ,ey -plane remain unchanged. 

L z and Ly are constants of motion: 

Ly = ©3603 = © 3 (d cos /3 + y) = constant, 

L z = L • e z = constant. 

We evaluate the scalar product in the body-fixed coordinates: 


L z — L • e z 

= ©i(d sin/3 siny + P cos y)(siny sin/3) 

+ ©i(a sin P cosy — P siny)(cosy sin/3) 

+ © 3 (d cos /3 + y)(cos /3) 

= ©i(dsin 2 ^) + ©3COS/3(dcos/l + y). (13.95) 

Here, we utilized (13.92). Equations (13.94) and (13.95) can be inverted, i.e., solved 
for y and a . 


a = 


L z 


y = L z' 


— Ly COS /3 
©isin 2 /3 

/j_ c °t 2 p \ 

V ©3 ' ©1 ) 


L z cos/3 


(13.96) 


©l sin 2 /I 


(13.97) 
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We insert the relations (13.96) and (13.97) obtained this way into the expression for 
the kinetic energy (13.93) and obtain 


*=2 0 ^ + T 


©1 f L 7 — L 7 t cost 


©i sin 2 0 


sin 


+ - ©3 ( a 2 cos 2 0 + y 2 + 2ay cos 0) + Mgl cos 0 

1 . 9 1 (L z — Lycos0 ) 2 \ L z f 

= ? ©l /* 2 + . 2 . + + Mgl COS/5 

2 2 ©i sm z 0 2 ©3 


(13.98) 


We used the constancy of L z and L z i to eliminate the two Euler angles a and y. 
This simplified the problem greatly. From the energy law (13.98) we can in principle 
determine 0(0 and then obtain a(t) and y(t) via (13.96) and (13.97). 

To proceed further, various possibilities offer themselves: 

(a) We can establish the equation of motion for 0(f): 


dE 

— =0 = 0100 + 
dt 


9^(0) A 
dp ^ 


Hence, energy conservation leads to the equation of motion: 


(13.99) 


®10 = -^Veff(0) 

dp 

1 9 cos P L 7 > 

= —(L z - L z /cos/ 6 ) 2 — -3— - . (L z -L z > cos ft) + Mgl sin ft 

©l siir ft ©l sin ft 

COS P '•y E 7 L M 

= A - 3 S L \ - 2L z L z ,cosft + L 2 ) - z , z + Mgl sin/l. (13.100) 

©1 sin -3 0 z ©l sin 0 


This is a one-dimensional differential equation for 0, although highly nonlinear. For a 
given solution Pit), a(t ), and y (f) can be found by integration of (13.96) and (13.97). 

(b) Another principal approach is to solve the differential equation (13.98) by sep¬ 
aration of variables and integration. 


f, 

P 

t -k)= / dp’ - 

J y2/0i(£-Vkf^')) 


(13.101) 


Thus, the time dependence of 0 can be determined by integration. 

Since (a) and (b) are likewise complicated, we restrict ourselves to a discussion of 
the effective potential, in order to understand the essentials of gyroscopic motion. 


Example 13.11 
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Discussion of the effective potential 


_ 1 (L z — L z ' cos/3) 2 

eff_ 2©i sin 2 ^ 


1 L,' 

+ 2 ©J +M S lcos P- 


(13.102) 


The effective potential is composed of three terms which we will discuss separately. 


Fig. 13.41. The effective po¬ 
tential is composed of three 
terms 



Spin term 


- —. (13.103) 

2 ©3 

The second term is constant and is due to the energy of the eigenrotation of the top 
about its figure axis. It shifts the zero point of the energy scale and is independent 
of /?. 

Angular momentum barrier 


1 ( L z — L z > cos /?) 2 
2©i sin 2 /? 


(13.104) 


The first term is understood by analogy to the l 2 /2mr 2 angular momentum term, 
which appeared in the effective potential when treating the central force prob¬ 
lem. It is positive and vanishes for L z /L z > = cos/?. Then /? is only a physi¬ 
cally meaningful angle if L z < L z >. This is in general fulfilled for a top with 
fast eigenrotation. For /? = 0, /? = 7T, the first term diverges because of the fac¬ 
tor sin 2 ^ in the denominator. As a consequence, the term then has a minimum 
which lies at /? = arccos (L z /L z r) < tt/2. For /3 —> 0 and /? —> n the potential rises 
steeply. 

Gravitation term 


Mgl cos p. 


(13.105) 
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The last contribution is caused by the gravitational potential. It is antisymmetric 
about the center tt/2 and shifts the minimum of the effective potential to the right side 
of arccos(L z /L z /) without changing its qualitative form. 

For a given energy E (> L z /2 ©3), the motion is restricted to the region 
E > Veff(^) with reversal points P±; these points are defined by E — VeffO^i)- 

For a more precise analysis of the motion, we determine the stationary solution for 
which P = fio = constant. It is located exactly in the minimum of the effective poten¬ 
tial, so that the reversal points P± coincide, and E = Veff(A))- A) is then determined 
from the minimum property: 


( 


aFeff 

dp 


= 0 . 

P=Po 


Hence, (13.100) leads to 


L 


2 

z 


2L Z L Z ' cos Po + L\, = 


E Z E Z ' sin Po 
COS Po 


©i Mgl sin Po 
COS Po 


(13.106) 


(13.107) 


This equation fixes po for given values of L z and L z r . 

Equations (13.96), (13.97) imply for d and y constant values do and yo. Hence, 
in dynamic equilibrium the top performs a constant rotation about its own axis 
y (t) = yot at fixed angle po , as well as a precession motion a(t) = do t with constant 
precession frequency do. 


Small oscillations about the dynamic equilibrium position 

In order to investigate the motion in the vicinity of Po, we consider small displace¬ 
ments from the equilibrium. Instead of explicitly solving (13.100), we write 


P(t) — Po + r]{t) 


(13.108) 


with an infinitesimal displacement r](t). We expand the potential into a Taylor series 

/dVeff\ 1 o /9 2 Veff\ 

VefK/J) = VefiKA) ) + >?Hdr +•••• < 13 - 109 ) 

V BP Jp =fSo 2 \ dt j p=fSo 


The linear term vanishes by construction, and the quadratic term follows by differen¬ 
tiation of the negative right side of (13.100): 


9 2 Vkf 

dp 2 


—Mgl cos P 


3 L Z L Z ' COS P 
©1 sin 2 P 


+ (L 2 z — 2L z L z r cos P + L 2 z ,) 


3 — 2 sin 2 P 
©i sin 4 p 


By inserting (13.107) for the last term, one obtains 


(13.110) 


9 2 Veff 

dp 2 


E Z E Z ’ 


© i Mgl (4 — 3 sin 2 Po) 


(13.111) 


Example 13.11 


P=Po 


©1 COS Po 
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Example 13.11 


For the total energy, one then obtains likewise 



+ Veff(A))- 


(13.112) 


Differentiation with respect to the time finally leads to the differential equation of the 
harmonic oscillator: 


rj + Q 2 r] = 0 
with 

= _ L^-0iMg/(4-3sin 2 ^o) 

©i V ) p=p 0 ©icos^o 

The corresponding motion 
P(t) = P o + r] 0 cos(Qt + Oq) 


(13.113) 


(13.114) 


(13.115) 


is stable if Q 2 > 0. Obviously the product L z Ly must be sufficiently large to ensure 
stable vibrations. 


Precession and nutation 

We insert the explicit solution of (13.113) into (13.96) and (13.97), and expand with 
respect to r](t): 


• u\ ~ Lz L z / cos Pq 9 (L z Ly cos/3 

a'At) ~^-1- ri(t )— -^- 

©1 sin 2 Po dp \ © x sin 2 P 

= d 0 + rj(t)ai, 

y{t) ~ yo + 


+ ■ 


P=Po 


(13.116) 


(13.117) 


where ao,ai,yo, yi are constants which depend on L z , Ly , and E (through Po). For a 
qualitative investigation of the superposition of nutation ( P(t )) and precession ( a(t )), 
we start from (13.116): 


a(t) — ao + r]ooi\ cos (Qt + (po ) 


= OC\T]0 


a o 

airio 


+ cos(Qt + cpo ) ). 


(13.118) 


For do/di? 7 o > 1, d always remains larger than zero (Fig. 13.42(a)). For do/dq?7o = 1, 
the precession frequency may become equal to zero (Fig. 13.42(b)). For ao/6(ir]o < 1, 
we have a backward motion in parts (Fig. 13.42(c)). 
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Fig. 13.42. 



(c) 


EXERCISE 


13.12 Stable and Unstable Rotations of the Asymmetric Top 


Problem. Use the Euler equations to show that for an asymmetric top the rotations 
about the axes of the largest and smallest moment of inertia are stable; however, the 
rotation about the axis of the intermediate moment of inertia is unstable. 

Solution. We start from the Euler equations for the free top: 


©2 — ©3 

0 ) 1 = ^ 0 ) 20 ) 3 , 

(13.119) 

©3 — ©1 

&>2 = ^ 0 )\ 0 ) 3 , 

©2 

(13.120) 

©1 — ©2 
©3 

(13.121) 


Let the top rotate about the body-fixed z-axis, i.e., 003 = coo = constant and co\ = C02 
= 0. To investigate the stability of the rotation about this principal axis, we tilt the 
rotation axis by a small amount, so that new components 8 co\ , 8002 and an additional 
8003 arise. For 8803, we have from the Euler equation 

©1 — ©2 

88)3 = — - -8coi8o) 2 - 0. (13.122) 

©3 

Neglecting quadratic small terms, we can set C 03 = coo. From the other two Euler equa¬ 
tions, we then obtain 


©3 — ©2 

8 o)\ -|- 8 o) 20 ){) = 0 , 

©1 

(13.123) 

©1 — ©3 

88 ) 2 + 8 o)\o)q = 0 . 

©2 

(13.124) 


To solve this coupled system, we use the ansatz 


8 co 1 = Ae lt , 
8 a ) 2 = Be Xt . 


(13.125) 
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Exercise 13.12 


This leads to a linear set of equations in A and B , where the determinant must vanish 
for nontrivial solutions: 


A 

© | - 0 3 
©2 


@3 — ©2 
©1 


-COO 


-COQ 


= 0 . 


(13.126) 


From this, we find the characteristic equation 


i z 

A = CO ( 


(©3 


© 2)(©1 
©1 ©2 


© 3 ) 


(13.127) 


For the rotation about the axis of the smallest moment of inertia ©3 < © 1 , © 2 , and 
for the rotation about the axis of the largest moment of inertia ©3 > © 1 , © 2 , equation 
(13.127) leads to a purely imaginary A: 


X 2 < 0, 


(13.128) 


and therefore to vibration solutions for 8 co\ and 8 gl> 2 - The rotation about the axis of the 
largest and smallest moment of inertia, respectively, is therefore stable. 

The rotation about the axis of the intermediate moment of inertia 


©1 > ©3 > ©2 or ©2 > ©3 > ©1 (13.129) 

leads to a real A and thus to a time evolution of 8 co\ and Sco 2 according to 

8 co \/2 = C 1/2 cosh Xt + Di/ 2 sinhAL (13.130) 

The rotation axis turns away exponentially from the initial position. The rotation about 
the axis of the intermediate moment of inertia is not stable! 
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In many cases, the motion of bodies considered in mechanics is not free but is re¬ 
stricted by certain constraint conditions. The constraints can take different forms. For 
instance, a mass point can be bound to a space curve or to a surface. The constraints 
for a rigid body state that the distances between the individual points are constant. 
If one considers gas molecules in a vessel, the constraints specify that the molecules 
cannot penetrate the wall of the vessel. Since the constraints are important for solv¬ 
ing a mechanical problem, mechanical systems are classified according to the type 
of constraints. A system is called holonomic if the constraints can be represented by 
equations of the form 

/*( ri,r 2 ,...,O = 0, k = 1,2,..., s. (14.1) 

This form of the constraints is important since it can be used for eliminating dependent 
coordinates. For a pendulum of length l (14.1) reads x 2 4- y 2 — l 2 = 0 if we put the 
coordinate origin at the suspension point. The coordinates v and y can be expressed 
by this equation. 

We already met another simple example of holonomic constraints in the context of 
the rigid body, i.e., the constancy of the distances between two points: (r z - — r j ) 2 — 
C 2 j = 0. In this case the constraints served to reduce the 3 N degrees of freedom of a 
system of N mass points to the 6 degrees of freedom of the rigid body. 

All constraints that cannot be represented in the form (14.1) are called nonholo- 
nomic. These are conditions that cannot be described by a closed form or by inequali¬ 
ties. An example of this type of constraint is the system of gas molecules enclosed in 
a sphere of radius R . Their coordinates must satisfy the conditions r z < R. 

A further classification of the constraint conditions is made based on their time 
dependence. If the constraint is an explicit function of the time, then it is called 
rheonomic. If the time does not enter explicitly, the constraint is called scleronomic. 
A rheonomic constraint appears if a mass point moves along a moving space curve, or 
if gas molecules are enclosed in a sphere with a time-dependent radius. 

In certain cases the constraints may also be given in differential form, for example 
if there is a condition on velocities, e.g., for the rolling of a wheel. The constraints 
then have the form 

N 

•",x N )dxj c = 0, (14.2) 

k 

where the Xk represent the various coordinates, and the a\ are functions of these coor¬ 
dinates. We now have to distinguish between two cases. 
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Fig. 14.1. A small sphere rolls 
on a large sphere 





Fig. 14.2. 


If (14.2) represents the total differential of a function U , we can integrate it imme¬ 
diately and obtain an equation of the form of (14.1). In this case the constraints are 
holonomic. If (14.2) is not a total differential, we can integrate it only after having 
solved the full problem. Then (14.2) is not suitable for eliminating dependent coordi¬ 
nates; it is nonholonomic. 

From the requirement that (14.2) be a total differential, one can derive a criterion 
for the holonomity of differential constraints. One must have 

> akdxk = dU with cik =-. 

V dXk 

This leads to 

dak 9 2 f/ dat 

dxi dxi dxk dxk 

Thus, (14.2) represents a holonomic constraint if the coefficients obey the integrability 
conditions 

dak d a i 

dxi dxk 

These only mean that the “vector” a = [a \, < 22 ,..., a^} must be rotation-free (irro- 
tational). In N -dimensional space, the situation is analogous. 

To classify a mechanical system, we additionally specify whether the system is 
conservative or not. 


EXAMPLE 


14.1 Small Sphere Rolls on a Large Sphere 

A sphere in the gravitational field rolls without friction from the upper pole of a larger 
sphere. The system is conservative. The constraints change completely after getting 
away from the sphere and cannot be represented in the closed form of (14.1), and 
therefore the system is nonholonomic. Since the time does not enter explicitly, the 
system is scleronomic. 


EXAMPLE 


14.2 Body Glides on an Inclined Plane 

A body glides with friction down on an inclined plane (see Fig. 14.2). The inclination 
angle of the plane varies with time. The coordinates and the inclination angle are 
related by 

- — tan cot = 0. 
v 
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Thus, the time occurs explicitly in the constraint. The system is holonomic and rheo- 
nomic. Since friction occurs, the system is furthermore not conservative. 


EXAMPLE 


14.3 Wheel Rolls on a Plane 


An example of a system with differential constraints is a wheel that rolls on a plane 
without gliding. The wheel cannot fall over. The radius of the wheel is a. 



Fig. 14.3. 


For the calculation, we use the coordinates xm , yM of the center, the angle (p that 
describes the rotation, and the angle xj/ that characterizes the orientation of the wheel 
plane relative to the y-axis. 

The velocity v of the wheel center and the rotation velocity are related by the rolling 
condition 


v = acp. 


The components of the velocity are 


xm = —v sin xf/, 
y M ~ vcosx/s. 

By inserting v , we obtain 
dxM + a sin xjs • dcp = 0, 
dyM — cl cos x/s • dip = 0, 

i.e., a constraint of the type of (14.2). 

Since the angle xjs is known only after solving the problem, the equations are not 
integrable. Hence, the problem is nonholonomic, scleronomic, and conservative. 


If a body moves along a trajectory specified (or restricted) by constraints, there ap¬ 
pear constraint reactions that keep it on this trajectory. Such constraint reactions are 
support forces, bearing forces (-moments), string tensions, etc. If one is not especially 
interested in the load of a string or a bearing, one tries to formulate the problem in 
such a way that the constraint (and thus the constraint reaction) no longer appears in 
the equations to be solved. We have tacitly used this approach in the problems treated 
so far. A simple example is the plane pendulum. Instead of the formulation in Carte- 


Exercise 14.2 
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sian coordinates, where the constraint x 2 + y 2 = / 2 must be considered explicitly, we 
use polar coordinates (r, <p). 

The constancy of the pendulum length means that the r -coordinate remains con¬ 
stant and that the motion of the pendulum can be completely described by the angle 
coordinate alone. This procedure—the transformation to coordinates adapted to the 
problem—shall now be formulated more generally. 

If we consider a system of n mass points, then it is described by 3 n coordinates 
ri, r 2 , ..., r n . The number of degrees of freedom also equals 3 n. If there are s con¬ 
straints , the number of degrees of freedom reduces to 3n — s. The set of originally 
3 n independent coordinates now involves s dependent coordinates. Now the meaning 
of the holonomic constraints becomes transparent. If the constraints are expressed by 
equations of the form (14.1), the dependent coordinates can be eliminated. We can 
transform to 3 n — s coordinates q\, q ^,..., q?>n-s that implicitly incorporate the con¬ 
straints and that are independent of each other. The old coordinates r, are expressed 
by the new coordinates qj by means of equations of the form 


ri =ri(qi,q2,...,q3n-s,t), 

*2 = T2(q\,q2, • • • , <73 n-s,t), 

(14.3) 

= ^n(q 1? q 2 i • • • » <73 n—si 0- 

These coordinates qt , which now can be considered free, are called generalized coor¬ 
dinates. In the practical cases considered here, the choice of the generalized coordi¬ 
nates is already suggested by the formulation of the problem, and the transformation 
equations (14.3) need not be established explicitly. Using generalized coordinates is 
also helpful for problems without constraint conditions. For instance, a central force 
problem can be described more simply and completely by the coordinates (r, d, cp) 
instead of the (x,y,z). 



As a rule, lengths and angles serve as generalized coordinates. As will be seen 
below, moments and energies etc. can also be used as generalized coordinates. 

EXAMPLE 


14.4 Generalized Coordinates 

An ellipse is given in the x,y-plane. A particle moving on the ellipse has the coordi¬ 
nates (v, y). 
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The Cartesian coordinates can be expressed by the parameter cp: 
y = bsirup, x=acoscp. 

Thus, the motion of the particle can be completely described by the angle ip (the 
generalized coordinate cp). 


EXAMPLE 

14.5 Cylinder Rolls on an Inclined Plane 

The position of a cylinder on an inclined plane is completely specified by the distance / 
from the origin to the center of mass and by the rotational angle <p of the cylinder about 
its axis. 

If the cylinder glides on the plane, both generalized coordinates are significant. 

If the cylinder does not glide, / depends on <p through a rolling condition. Only one 
of the two generalized coordinates will then be needed for a complete description of 
the motion of the cylinder. 


EXERCISE 


14.6 Classification of Constraints 


Problem. Classify the following systems according to whether or not they are scle- 
ronomic or rheonomic, holonomic or nonholonomic, and conservative or nonconserv¬ 
ative: 

(a) a sphere rolling downward without friction on a fixed sphere; 

(b) a cylinder rolling down on a rough inclined plane (inclination angle a ); 

(c) a particle gliding on the rough inner surface of a rotation paraboloid; and 

(d) a particle moving without friction along a very long bar. The bar rotates with the 
angular velocity co in the vertical plane about a horizontal axis. 

Solution, (a) Scleronomic, since the constraint is not an explicit function of time. 
Nonholonomic, since the rolling sphere leaves the fixed sphere. Conservative, since 
the gravitational force can be derived from a potential. 

(b) Scleronomic, holonomic, nonconservative: The equation of the constraint rep¬ 
resents either a line or a surface. Since the surface is rough, friction occurs. Therefore 
this system is not conservative. 

(c) Scleronomic, holonomic, but not conservative, since the friction force does not 
result from a potential! 

(d) Rheonomic: The constraint is an explicit function of time. Holonomic: The 
equation of the constraint is a straight line that contains the time explicitly; conserva¬ 
tive. 



Fig. 14.5. A cylinder rolls on 
an inclined plane 
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14.1 Quantities of Mechanics in Generalized Coordinates 

The velocity of the mass point i can according to the transformation equation 
r i =Yi{q\,..., q v , t) 


be represented as 

dr* dq\ 
dq\ dt 


r, = 


dr* dq v dr* 

+ ■■■-) - -—4 - 

dq v dt dt 


In the scleronomic case, the last term drops. The velocity can also be written in the 
form 


dr* . dr* 
r * = > — -q a + — 

at 


where q a = 


dq a 

dt 


(14.4) 


and q a denotes the generalized velocity. In the following, we restrict ourselves to 
the x-component. Moreover, we consider only the scleronomic case and write for the 
x-component of (14.4) 


Xi 


E dx* . 

a 9 <7“ 


(14.5) 


By differentiating (14.5) once again with respect to time, we obtain for the Cartesian 
components of the acceleration 


x* 


E d / dx* \ . oa* .. 

Tt\~^r.) qa + ~^r qa - 


' dt \ dq ( 


dx* 


d^o; 


The total derivative in the first term is written as usual: 


d / dx* \ v 
dt\dq a ) “ 


d 2 x* 


dqpdq a 


The index to be summed over is denoted here by the letter , to avoid confusion 
with the summation index a . Then we have 


*«• = H 


a,ft 


d 2 X* 

dqpdq c 


-qpqa 


E dx* .. 

7 qa- 
„ 9< ?a 


The first term involves double summation over a and /3. 

Let a system have the generalized coordinates q\,... ,qy that now shall be in¬ 
creased by dq \,..., dqy. We will determine the work performed by this infinitesimal 
displacement. For an infinitesimal displacement of the particle i , we have 


/ 


dti =£ 

a=l 

From this, we obtain the work performed: 


(14.6) 


/ 


dw = • dn = Fi • P- )dq a = 


i =1 


i =1 \a=\ 


dq<x 
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The components of the generalized force are thus obtained as the derivative of the 
work with respect to the corresponding generalized coordinate. 



D'Alembert Principle and Derivation 
of the Lagrange Equations 


15 


15.1 Virtual Displacements 

A virtual displacement 8r is an infinitesimal displacement of the system that is com¬ 
patible with the constraints. Contrary to the case of a real infinitesimal displacement 
Jr, in a virtual displacement the forces and constraints acting on the system do not 
change. A virtual displacement will be characterized by the symbol 8, a real displace¬ 
ment by d. Mathematically we operate with the element 8 just as with a differential. 
For example, 

8 sinv 

8 sinx =-8 x = (cosx)5x, etc. 

8 x 

We consider a system of mass points in equilibrium. Then the total force F \ acting 
on each individual mass point vanishes; hence, F/ = 0. The product of force and virtual 
displacement Fj • <5r z - is called the virtual work. Since the force for each individual 
mass point vanishes, the sum over the virtual work performed on the individual mass 
points also equals zero: 



(15.1) 


The force Fj will now be subdivided into the constraint reaction F z and the acting 
(imposed) force F^: 



(15.2) 


We now restrict ourselves to such systems where the work performed by the constraint 
reactions vanishes. In many cases (except, e.g., for those with friction) the constraint 
reaction is perpendicular to the direction of motion, and the product F z • 8 r vanishes. 
For instance, if a mass point is forced to move along a given spatial curve, its direction 
of motion is always tangential to the curve; the constraint reaction points perpendicular 
to the curve. There are, however, examples where the individual constraint reactions 
perform work, while the sum of the works of all constraint forces vanishes; thus, 


J2 F r S r,-=o. 
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The string tensions of two masses hanging on a roller represent such a case. We 
refer to Example 15.1. This is the proper, true meaning of the d’Alembert 1 principle: 
The constraint reactions in total do not perform work. We always have 

E F H r ‘=°- 

i 

This is the fundamental characteristic of the constraint reactions. One can, of course, 
trace this presupposition back to Newton’s axiom “action equals reaction,” as we just 
have seen in the example of the string tensions between two masses. But in general 
it does not follow from Newton’s axioms alone. The assumption that the total virtual 
work of the constraint reactions vanishes can be considered to be a new postulate. It 
accounts for systems of not freely movable mass points and can be expressed by the 
forces imposed on the system, as we shall see below (see (15.5)). Then the constraint 
position drops out from (14.2), and one has 


y>r<5r,=0. (15.3) 

i 

While in (15.1) each term vanishes individually, now only the sum in total vanishes. 
The statement of (15.3) is called the principle of virtual work. It says that a system is 
only in equilibrium if the entire virtual work of the imposed (external) forces vanishes. 
In the next chapter (equations (16.8) and (16.9)) the principle of virtual work (the total 
virtual work vanishes) will be established by the Lagrangian formalism. 

For holonomic constraints, the effect of the constraint reactions can be elucidated 
by the following: If we consider the i th constraint in the form 

g;(ri,r 2 , ...,rjv,0=0, 


then the change of gi with respect to a change of the position vector r j must be 
a measure of the constraint reaction F*.. on the jth particle due to the constraint 
gi (ri, r 2 ,..., !•#, t) = 0. We thus can write 


F z . 

ji 


= *i 


d&(ri,r 2 , ...,r N ,t) 
3r / 


--kiVjgi(ri,...,t). 


Here ki is an unknown factor, since the constraints gi (ri, r 2 ,..., , t) = 0 are known 

up to a nonvanishing factor. The total constraint reaction on the jth particle is then the 
sum over all constraint reactions originating from the individual k constraints; hence, 


F ;=En=E 1 - 


1 = 1 


i=1 


dgi(ri,...,r N ,t) 
3r / 


1 Jean le Rond d’Alembert , b. Nov. 16 or 17, 1717, Paris, as the son of a general-d. Oct. 29, 1783, 
Paris. D’Alembert, who was abandoned by his mother, was found near the church Jean le Rond and 
was brought up by the family of a glazier. Later he was educated according to his social status, 
supported by grants. He studied at the College des Quatre Nations, and in 1741, he became a member 
of the Academie des sciences. In mechanics, the d’Alembert principle is named after him; moreover, 
he worked on the theory of analytic functions (1746), on partial differential equations (1747), and on 
the foundations of algebra. D’Alembert is the author of the mathematical articles of the Encyclopedie. 






15.1 Virtual Displacements 


269 


The virtual work performed by all constraints is then 


k N 


8w = J2 ¥ )■ Sr J= Z! X! • • • *• 8r J 


j =1 <'=1 j =1 

k 

J2Xi8gi(r u ...,r N ,t), 

(=1 


where 


N d 

Sgi(ru ...,r N ,t) = - Sr,-. 

, dr j 

J = 1 


This is just the change of gi caused by the virtual displacements 8 yj . Since the virtual 
displacements are by assumption compatible with the constraints, i.e., the 8 rj satisfy 
the constraints, we must have 


8gi(ri,...,r N ,t) = 0. 

From this, we see immediately that 

• r ; - = 0 (15.4a) 

and therefore also 

N 

SW = Z F j - Sr J =0- (15.4b) 

j =i 

Hence, for holonomic constraints the constraint reactions are perpendicular to the dis¬ 
placements that are compatible with the constraints, and the virtual work of the indi¬ 
vidual constraint reactions vanishes. In Chap. 16, equations (16.8) and (16.9), we shall 
understand from a very general point of view that in the general case (hence including 
the case of nonholonomic constraints), the sum of the virtual work of all constraint re¬ 
actions must vanish. Therefore, JT • <5r*- = 0 always holds, while F^ • 8 rt = 0 holds 
only in special (holonomic) cases. 

The principle of virtual work at first only allows us to treat problems of statics. By 
introducing the inertial force according to Newton’s axiom 

Ff=p/, (15.5) 

D’Alembert succeeded in applying the principle of virtual work to problems of dy¬ 
namics as well. We proceed in an analogous way to derive the principle of virtual 
work. Because of (15.4a) and (15.4b) in the sum 


y>,-p,).Sr,=0, (15.6a) 

i 

every individual term vanishes. If we again subdivide the total force F i into the im¬ 
posed force F^ and the constraint reaction F^, with the same restriction as above we 
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*l ^2 


Fig. 15.1. Two masses on con¬ 
centric rollers: The string ten¬ 
sions and are parallel 
but have different magnitudes 


find the equation 

^(F“ — p,) • <5r, = 0, (15.6b) 

i 

where the individual terms can differ from zero; only the sum in (15.6b) vanishes. This 
equation expresses the d’Alembert principle. 

EXAMPLE 


15.1 Two Masses on Concentric Rollers 

Two masses m\ and m 2 hang on two concentrically fixed rollers with the radii R\ 
and R 2 . The mass of the rollers can be neglected. The equilibrium condition shall be 
determined by means of the principle of virtual work. 

For the conservative system under consideration (where no friction appears), the 
total work performed by the constraint reactions vanishes, i.e., 

r/=0. 

i 

In the present example, the constraint forces are the string tensions F * and . 

The vanishing of JT • <5r z - in the equilibrium state is equivalent to the equality 
of the torques imposed by the string tensions through the radii R\, R 2 ’ 

Di = R { F* = D 2 = R 2 F 2 Z . 

By means of the constraint, it follows with 8 z\ = R\ 8 (p, 8 z 2 = ~R 28 V, that 
F[ 8 z\ + F* 8 z 2 = (FfRi - F*R 2 )8<p = {D x - D 2 )8cp = 0. 

In the case of equal radii ( R\ = R 2 ), the string tensions are equal. 

From 

E F "-^= 0 ’ 

i 

it follows that 

m\g 8 z\ -\-m 2 g 8 z 2 = 0. 

The displacements are correlated by the constraint condition; we have 
8 z\ = R\ 8 (p, 8 Z 2 = -R 28 cp. 

Hence, we obtain 

(m\R\ — m 2 R 2 ) 8 cp = 0 


or 


m 1 R 1 = m 2 R 2 


as the equilibrium condition. 











15.1 Virtual Displacements 


271 


EXAMPLE 

15.2 Two Masses Connected by a Rope on an Inclined Plane 

In the setup shown in Fig. 15.2, two masses connected by a rope move without friction. 
The equation of motion shall be established by means of the d’Alembert principle. For 
the two masses, this principle reads 


(Ff — pi) • 8\\ + (F 2 — p 2 ) • 8fa = 0. 


(15.7) 



The length of the rope is constant (constraint): 

fa -\~fa = l- 
This leads to 

8 l\ = — 8 fa and 'fa — —fa. 

The inertial forces are 

pi = mih and P 2 = m 2 l 2 - 

By inserting this into (15.7) and taking into account that the accelerations are par¬ 
allel to the displacements, we have 

(m\g sin a — mi/i)5/i + (m 2 g sin /3 — m 2 fa) 8 fa = 0, 

( m\g sin a — m\l\ — m 2 g sin /3 — m 2 fa) 8 l\ = 0, 

or 

mi sin« — m 2 sin^ 

h = --- g. 

m 1 + m 2 


EXERCISE 


15.3 Equilibrium Condition of a Bascule Bridge 

Problem. Find by means of the d’Alembert principle the equilibrium condition for 

(a) a lever of length fa, with a mass m at a distance fa from the bearing point, and 
with a force F\ acting vertically upward at its end; and 

(b) the bascule bridge in Fig. 15.4, with the forces G and Q acting. 


Fig. 15.2. Two masses on an 
inclined plane connected by a 
rope 



Fig. 15.3. Lever with mass m 
and force F/ 
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Solution, (a) The d’Alembert principle yields 
-Sr v = 0. 

V 

We have 

Fi = F\t y , F 2 =-mge y 
and 

ri = /i cos cpe x + 1\ sin cpe y , 

l 2 

Y 2 = h cos cpe x + h sin cpe y = — n. 

h 

Furthermore, 

h 

8ri = (—l\ sirupe x +1\ coscpe y )8(p and 8r 2 = —r\. 

h 

This leads to 

2 

Y, F v • 8r v = (F\li cos cp — mgl 2 cos cp)8(p = 0, 

V = 1 

i.e., the equilibrium condition reads 

I 2 n 3 7T 

Fi = -mg for <p^ 

(b) The forces acting at the points 1 and 2 are 
F 2 = — Ge v , Fi = —Qe y . 

Furthermore, 

ri = —a cos cpe x + (d — a sin cp)e y 
and 

Y 2 = (b + c) cos cpe x + (b + c) sin cpe y ; 
i.e., 


<$ri = (a sin cpe x — a cos cpe y )8(p 
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and 

<Sr 2 = (—(£> + c) sin<pe x + (b + c) cos (pe y )8(p. 

The d’Alembert principle reads 

2 

0 = ^F v • 8r v = (Qa cos cp — G(b + c)coscp)8(p = [Qa — G(b + c)]coscp8cp. 

V=1 

The equilibrium condition 


is independent of the angle (p ! 


As is seen in Example 15.1 and Exercise 15.2, the drawback of the principle of vir¬ 
tual displacements is that one still must eliminate displacements that are dependent 
through constraints before one can find an equation of motion. We therefore introduce 
generalized coordinates qt. If we transform the 8rt in (15.6a) to 8qt, the coefficients 
of the 8qt can immediately be set to zero. 

Starting from (15.6a), we introduce in the first sum according to (14.6) and (14.7), 
the generalized forces 

" " f g r . f 

• Sri = • J2 V^ 8 ^ = £ Q^Sq a . (15.8) 

i =1 i—\ a=\ a=l 

We now consider the other term in (15.6a): 


£p i - Sri = 'Y^m i r i ■ <5r ; . 

i i 

If we express <5r, according to (14.6) by the <5c/,, we obtain 

Pi ■ Sr t = Vm,r, • -8q v . (15.9) 

. dq v 

l l,V 



By adding and simultaneously subtracting equal terms, we rewrite the right-hand side 
of the equation: 



Exercise 15.3 


(15.10) 
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To derive the expression for the kinetic energy, we change the order of differentiation 
with respect to t and q v in the last term of (15.10): 


d / 3r* 


dt \dq v ) dq v \dt 
Insertion in (15.10) yields 


— r * I = 7—v/ 

dq v 




9r; \ 3 

- — m;\; • - V; 

dq v ) dq v 


(15.11) 


(15.12) 


We can rewrite the expression 9r* /dq v in the first term of the right side of (15.12) by 
partially differentiating (14.4) with respect to q v : 

dVi _ dri 
dq v dq v ’ 

since (d/dq v )(dri/dt) = 0 and from the sum remains only the factor at q v . By insert¬ 
ing this relation into (15.12), we obtain 


Here, J]-(l/2)m/V? is the kinetic energy T : 


“V dq v ) dt \ dq v ) dq v 
Insertion into (15.9) leads to 


d ( dT 


dT 


(15.13) 


Using (15.8) and (15.13), we can express the d’Alembert principle by generalized 
coordinates. Insertion of 


F/ • 5r, = ^ Q v &q v (compare (15.8)) 


into (15.6a) yields 

t dT \ dT „ \o 

qv ~ 


(15.14) 


The q v are generalized coordinates; thus, the q v and the related 8q v are independent 
of each other. Therefore, (15.14) is satisfied only if the individual coefficients vanish, 
i.e., for any coordinate q v we must have 


d ( dT 


dT 


dt\dq v ) dq v Qv °’ V r...,/. 


(15.15) 
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As a further simplification, we assume that all forces Fj can be derived from a potential 
V (conservative force field): 


F( = —grad ( (V) = —V,(V). 

In this case, the generalized forces Q v can be written as 

«2 l) =y'F i - — = -y'v ( v 

4^ dq v ^ dq v dq v ' 


because 


E l dV dV uv \ i ojii uyi ozi 
( ^r.. ejc + Av.- ey + At.. * z ) ’ ( + A/i.. Cy + A/1-. 


ay 


dXj 




dz 


dxi 


dyi dzi 


dqv dqv 


dq v 


^/ay dxi ay d yi ay d Zi 

^\dxi dq v dyi dq v dzt dq v 


_ ay 
dq v ' 

By inserting Q v = —dV/dq v into (15.15), we obtain 

d / dT\ dT dV _ o 

dt \dq v J dq v dq v 


and 


d/dT\ ar-y_ o 

dt y dq^) J dq y 


y is independent of the generalized velocity; i.e., V is only a function of the position: 

— = 0 . 

dq v 

Therefore, we can write 


d a a 

(T-V)-—(T-V) = 0, 

a^y 

or, by defining a new function, the Lagrangian 2 

L = T -V, 
d dL dL 

-7-=0, V = 1, . . . , 

dt dq v dq v 


(15.16) 

(15.17) 

(15.18) 


2 Joseph Louis Lagrange, b. Jan. 25, 1736, Torino-d. April 10, 1813, Paris. Lagrange came from a 
French-Italian family and in 1755 became professor in Torino. In 1766, he went to Berlin as director 
of the mathematical-physical class of the academy. In 1786, after the death of Friedrich II, he went to 
Paris. There he essentially supported the reformation of the system of measures and was a professor 
at various universities. His very extensive work includes a new foundation of variational calculus 
(1760) and its application to dynamics, contributions to the three-body problem (1772), application 
of the theory of continued fractions to the solution of equations (1767), number-theoretical problems, 
and an unsuccessful reduction of infinitesimal calculus to algebra. With his Mecanique Analytique 
(1788), Lagrange became the founder of analytical mechanics. 
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These equations are called Lagrange equations , and the quantities dL/dq v are called 
generalized momenta. In Newton’s formulation of mechanics, the equations of mo¬ 
tions are established directly. The forces are thus put in the foreground; they must be 
specified for a given problem and inserted into the basic dynamic equations 

Pi = F|, i = l,...,N. 

In the Lagrangian formulation the Lagrangian is the central quantity, and L includes 
both the kinetic energy T and the potential energy V. The latter one implicitly in¬ 
volves the forces. After L is established, the Lagrange equations can be established 
and solved. Both methods are equivalent to each other, as can be seen by stepwise 
inversion of the steps leading from (15.6a) to (15.18). 


EXAMPLE 

15.4 Two Blocks Connected by a Bar 

Two blocks of equal mass that are connected by a rigid bar of length l move without 
friction along a given path (compare Fig. 15.5). The attraction of the earth acts along 
the negative y-axis. The generalized coordinate is the angle a (corresponding to the 
single degree of freedom of the system). 

Fig. 15.5. Two blocks are y 
connected by a bar 


For the relative distances x and y of the two blocks, we have 
x — l cos a, y = l sin a. 

The constraint is holonomic and scleronomic. We will determine the Lagrangian 
L = T -V. 

The kinetic energy of the system is 
T = Li{x 2 + y 2 ). 

For this purpose, we form x and y: 

x = — /(sino')o', y =/(cosQ')d. 

Thus, we get for T 

T = - m(l 2 (sin 2 a) a 2 + l 2 (cos 2 a) a 2 ) = -ml 2 a 2 . 
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For the potential, we have (conservative system), 
V = mgy = mgl sin a. 

The Lagrangian therefore reads 


1 


L = T — V = -ml 2 a 2 — mgl sin o'. 

2 

We insert L into the Lagrange equation (15.16): 


d dL dL d 9.2 

-:-= — (ml a ) + mgl cos a = 0 

dt da da dt 


and 

2- S 

ml amgl cos a = 0, a + ycosa' = 0. 

Multiplication by a yields 

g 

aa + —a cos a = 0. 

These equations can be integrated directly. One obtains 

-a 2 + — sin a = constant = c 
2 / 

or 


/ 2 ( c — — sin a ). 


Separation of the variables a and t leads to the equation 


dt = 


da 


t -1 0 = 


V2 (c — (g/l) sin a) ’ J V2(c — (g//)sina 

ao 


a 

/ 




The constants c and *o are determined from the given initial conditions. 

EXAMPLE 


15.5 Ignorable Coordinate 

We will use the following example for the Lagrangian formalism to explain the con¬ 
cept of the ignorable coordinate. The arrangement is shown in Fig. 15.6. 



Example 15.4 


Fig. 15.6. Two masses m and 
M are connected by a string 
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Example 15.5 Two masses m and M are connected by a string of constant total length l = r + s. 

The string mass is negligibly small compared to m + M. The mass m can rotate with 
the string (with varying partial length r) on the plane. The string leads from m through 
a hole in the plane to M, where the mass M hangs from the tightly stretched string 
(with the also variable partial length s = l — r). Depending on the values co of the 
rotation of m on the plane, the arrangement can glide upward or downward. Thus, the 
mass M moves only along the z-axis. The constraints characterizing the system are 
holonomic and scleronomic. This arrangement has two degrees of freedom. The two 
corresponding generalized coordinates <p and s uniquely describe the state of motion 
of this conservative system. 

We have 

x — r cos (p = (l — s) cos (p, 
y = r sirup = (l — s) sirup. 

For the kinetic energy T of the system, we obtain 

1 (d \ 2 1 7 . 7 1 

T = —wi ( —(/ ~ S )J + ~(l ~ s) 2 rmp 2 +-Ms 2 

= —(m + M)s 2 + -(/ — s) 2 m<p 2 . 

The potential V reads 


V = -Mgs. 

For the Lagrangian L, we get 

L = T — V = - (m + M)s 2 + -(/ — s) 2 m(p 2 + Mgs. 
We now form 


d dL 
dt ds 
d dL 
dt dcp 


(m + M)s, 

d 2 

-((l-s) m<p). 


dL , 

— = —(/ — s)m(p + Mg , 
3 s 

^= 0 . 

dip 


Because 3L I dip = 0, ip is called an ignorable or cyclic coordinate. The Lagrange 
equation for ip then reduces to 


d dL 
dt dip 


L((l - s) 2 m(p) = 0 
dt 


or 


(l — s) 2 (pm = L = constant. 

Here, L is the angular momentum of the rotating mass m. 

This first integral of motion is the angular momentum conservation law. Generally 
speaking, the Lagrangian equation of motion 


d dL dL 
dt dqj dqj 
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for an ignorable (cyclic) variable reduces to 


d dL 
dt dqj 


= 0 or 



Here, pj = dL/dqj is the generalized momentum. The generalized momentum re¬ 
lated to the cyclic coordinate is thus constant in time. Therefore, the general conserva¬ 
tion law holds: The generalized momentum related to a cyclic coordinate is conserved. 
The Lagrange equation for s reads 


(m + M)s + (/ — s)mcp 2 — Mg = 0 


or, after multiplication by i, 


(m + M)ss + 


L 2 s 

(/ — s) 3 m 


Mgs = 0, with L = (/ — s) 2 m<p. 


The last equation can be integrated immediately, and we obtain as a second integral 
of motion 


1 

2 


(m + M)s 2 + 


L 2 

2(1 — s) 2 m 


Mgs = constant = T + V = E; 


i.e., the total energy of the system is conserved. The given system is in a state of equi¬ 
librium (gravitation force = centrifugal force) for vanishing acceleration, d 2 s/dt = 0: 


1 


m + M 
1 


m + M 


Mg — (l — s)m 


Mg- 


(l — s) 2 m 


(l — s) 2 m J 


The result states that s must be constant. For a fixed distance so, equilibrium therefore 
appears for a definite angular momentum L = Lo, which corresponds to a definite 
angular velocity co = (p: 


L 0 = JMmg(l-so) 3 . 


For L > Lo, the entire arrangement glides upward; for L < Lo, the string with the 
two masses m and M glides downward. For L = Lo, the system is in an equilibrium 
state. For the special case L = 0 (i.e., (p = 0, no rotation on the plane), one simply has 
the retarded free fall of the mass M. 


Example 15.5 


EXAMPLE 


15.6 Sphere in a Rotating Tube 

As a further example of the Lagrangian formalism, we discuss a problem with 
a holonomic rheonomic constraint. A sphere moves in a tube that rotates in the 
x, y-plane about the z-axis with constant angular velocity co. 
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Fig. 15.7. Sphere in a rotating 
tube 



side view 



This arrangement has one degree of freedom. Accordingly, we need only one gen¬ 
eralized coordinate for a complete description of the state of motion of the system: the 
radial distance r of the sphere from the rotation center. 

One has 

x = r coscot, 
y = r sin cot. 

The Lagrangian L — T — V then reads 

L = -m(x 2 + y 2 ) = -m(r 2 + co 2 r 2 ), 

if we take into account that for this arrangement the potential V = 0. 

We now form 

d 3 L .. 3 L 2 

-r = mr, — = mco r. 

dt 3r 3r 

Then we obtain the Lagrange equation 
mr — mco 2 r = 0, 


or 


r — co 2 r = 0. 

This differential equation corresponds—up to the minus sign—to the equation for the 
nondamped harmonic oscillator. It has a general solution of the type 

r(t) = Ae^ + Be"^. 

With increasing time t, this expression for r(t) also increases; i.e., 
lim r(t ) = oo for A > 0. 

t—^oo 

From the physical point of view, this means that the sphere is hurled outward by the 
centrifugal force that results from the rotation of the arrangement. 

The energy of the sphere increases. The reason is that the constraint reaction per¬ 
forms work on the sphere. Although the constraint force is perpendicular to the tube 
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wall, it is not perpendicular to the trajectory of the sphere. Hence, the product F z • 8s 
does not vanish. 


EXERCISE 


15.7 Upright Pendulum 


Problem. Determine the Lagrangian and the equation of motion of the following 
system: Let m be a point mass on a massless bar of length / which in turn is fixed to 
a hinge. The hinge oscillates in the vertical direction according to h(t) = ho cos cot. 
The only degree of freedom is the angle ft between the bar and the vertical (upright 
pendulum). 



Solution. The position of the point mass m is (x, y): 

x = l sin ft, y = h(t) + 1 cos ft = ho cos cot + l cos ft. 
Differentiation of this equation yields 

x = ftl cos ft, y = — (coho sin cot + ft l sin#). 

Hence, the kinetic energy T becomes 

T = im(i 2 + y 2 ) 

= -m(# 2 / 2 + co 2 h^ sin 2 ®? + Icoh^l sin?? sin®?), 
and the potential energy reads 

V = mgy = mg(ho coscot + l cos ft). 

Then the Lagrangian becomes 


Exercise 15.6 


Fig. 15.8. A mass m is fixed to 
one end of a bar; the other end 
of the bar is fixed to an oscil¬ 
lating hinge 


L — T — V 

— — [ft 2 1 2 + co 2 h^ sin 2 cot + Icohoft sin# sm cot — 2g (ho cos cot + /cos#)]. 
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Exercise 15.7 


The Lagrange equation reads 

d /3L\ 3L_ q 

dt \dft) dft ~ ’ 

dL 2 . 

— - = ml ft -ft mcohnl sin # sin cot, 

3ft 

dL 

— = m coho ft l cos ft sin cot + mgl sin ft, 
dft 

d dL 9” • ? 

—-- — ml ft ft- mcoholft cos ft sin cot + mco hoi sin ft coscot, 

dt dft 

l 2 ft -ft cohol ft cos ft sin cot -ft co 2 hoi sin ft coscot — cohol ft cos ft sin cot — gl sin# = 0, 


or 

l ft + co 2 ho sin ft cos cot — g sin ft = 0. 

The substitution ft' = ft — n =>- sin# = — sin#'; for small displacements, — sin ft' ~ 
—ft', i.e., 

lift' -ft (g — co 2 ho cos cot) ft' = 0. 

This is the desired equation of motion. If the piston is at rest, i.e., h(t) = ho = 0, we 
get 

£' + !#' = (). 

This is the equation of motion of the ordinary pendulum! 


EXERCISE 


15.8 Stable Equilibrium Position of an Upright Pendulum 

Problem. Find the position of stable equilibrium of the pendulum of Exercise 15.7 
if the hinge oscillates with the frequency co V g/l . 

Solution. We first rewrite the Lagrangian of the pendulum of Exercise 15.7 as fol¬ 
lows: The terms 


mco 


#q sin 2 cot and — mgho cos cot 


can be written as total differentials with respect to time: 


mco 


-hn sin 2 cot = — I —mcohi sin cot cos cot \ -ft C, 
2 u dt V 4 u 1 

d ( mgho . , . 

—mgho cos cot = — I-sin cot ). 

dt \ co 


d ( 1 
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We can omit these terms, since Lagrangians that differ only by a total derivative with 
respect to time, according to the Hamilton principle 8 Ldt = 0, are equivalent. 
Hence, 


m . 


Ill r • O O 7 7 7 * "I 

L = — / + coh 0 sin cot-\-2coho ftl sin ft sin cot — 2g(ho cos cot-\-1 cos ft)\ 


= — [ft 2 1 2 + 2cohoftl sin ft sin cot — 2 gl coscot]. 


(15.19) 


Another transformation yields 


mcohoft l sin ft sin cot =- (mcohol cos ft sin cot) +mco 2 hol cos ft coscot, 

dt 

so that the Lagrangian finally reads 


L = — [ft 2 l 2 + 2co z hol cos ft coscot — 2glcosft]. (15.20) 

From this, one obtains of course the equation of motion as in Exercise 15.7. 

We consider ft as a generalized coordinate with the appropriate mass coefficient 
ml 2 . The equation of motion then reads 


ml 2 ft — mgl sin ft — mco 2 hol sin ft cos cot 
du 

= ~M + f 


(15.21) 


with u = mgl cos ft and / = —mco 2 hol sin ft coscot. The additional force / is due to 
the motion of the hinge. For very fast oscillations of the hinge, we assume that the 
motion of the pendulum in the potential u is superposed by quick oscillations §: 




The average value of the oscillations over a period 2n/co equals zero, while ft changes 
only slowly; therefore, 



In/co 

J ft(t)dt = ft(t). 

o 


(15.22) 


Equations (15.21) with (15.22) can then be written as 

ml 2 §(t) + ml 2 '£(t) = + /(#). 

dv 

Because f(ft) = f(ft + §) = f(ft) + t-df/dft, an expansion up to first order in § 
yields 


, dU d 2 U ~ df 


The dominant terms for the oscillations are m/ 2 | and f(ft): 


ml 2 'i = f(d) 


=> f = ' 


co 2 ho . 


I 


sin ft coscot, 


(15.23) 


Exercise 15.8 





284 


15 D'Alembert Principle and Derivation of the Lagrange Equations 


Exercise 15.8 


and from this, we obtain 




ho . ~ 

— sm d cos cot — 


f 

mco 2 l 2 


(15.24) 


We now calculate an effective potential created by the oscillations, and for this purpose 
we average (15.23) over a period 2tt/co (the mean values over § and / vanish): 


2 ~ dU df dU 1 df 

ml d =-~ +§ — =-txt- — f — . 

dd dd dd mco 2 l 2 dd 

This can be written as 


ml 2 d = — 


dUeB 

dd 


with 


f/eff=f/ + 


1 

2m co 2 — l 2 


f 2 - 


(15.25) 


Because cos 2 cot = 1/2, we get 
mco 2 hi 0 

t/ eff = U + —sin 2 d 


mco 2 hi 2 

■ mgl cos d H-sin d. 


(15.26) 


The minima of t/ e ff gi ye the stable equilibrium positions: 

dU e & moo 2 hi i 

-= —mgl sin d H-- sin d cos d = 0 

dd 4 

2 gl 

=4> sin^ = 0 or cos d = (15.27) 

co 2 h q 

From this, it follows that for any the position vertically downwards (d = n) is 
stable, d = 0 is excluded because U e ff(d = 0) = mgl. Additional stable equilibrium 
positions arise for co 2 > 2gl / with the angle given above. 


EXERCISE 


15.9 Vibration Frequencies of a Three-Atom Symmetric Molecule 


3/2/1 

•-o-• 

ABA 
Fig. 15.9. Linear three-atom 
symmetric molecule 


Problem. Find the vibration frequencies of a linear three-atom symmetric molecule 
ABA. We assume that the potential energy of the molecule depends only on the dis¬ 
tances AB and BA and the angle ABA. Write the Lagrangian of the molecule in 
appropriate coordinates (normal coordinates) where the Lagrangian has the form 


L = ^^(©2_ ft ,2 @ 2 ) 


The co a are the desired vibration frequencies of the normal modes. If one cannot 
find the normal coordinates of the system, one can proceed as follows: If a system has 
s degrees of freedom and does not vibrate, then the Lagrangian generally reads 

L = \ yy m ik XiX k - k ik XiX k ). 
i,k 
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The eigenfrequencies of the system are then determined by the so-called characteristic Exercise 15.8 

equation 


det \kik - w 2 m ik \ = 0 . 

Solution. We describe the geometry of the molecule in the x,y-plane. Let the dis¬ 
placement of the atom a from the rest position r a o be denoted by x a = (x a ,y a ), i.e., 
r a = r a o + • The forces that keep the atoms together are assumed to be to first order 

linear in the displacement from the rest position, i.e., 

L = ^(i?+i 3 2 ) + ^i 2 2 - El [(XI -x 2 ) 2 + (*3 -x 2 ) 2 ], 

if we consider longitudinal vibrations. For these modes the conservation of the center 
of gravity can be written as follows: 


m A {x\ + X 3 )m B X 2 = 0 , 


^ \ rrigYa — ^ ^ rrigYa p , 
a a 


and we can eliminate X 2 from L: 


m A 


m, 


L — — (iq +i 3 ) + 

2 2m B 


(ii +X 3 ) 


Kl 

2 


x\ + x 3 + 2— —(x\ +X3) 2 H - ^-(vi +X3) 2 


m B 


l B 


Hence, only two normal coordinates for the longitudinal motion can exist, because of 
the conservation of the center of gravity. 

Let ©1 = x\ + X 3 , ©2 = x\ — X 3 . L can then be written as 


L = 


^©2 + 

4 z 


m A /r . 2 



K l /x 

4m 2 B 




Ii = 2 m A +m B , 


i.e., ©1 and ©2 are the two normal coordinates of the longitudinal vibration (/x repre¬ 
sents the total mass of the molecule). 

(a) For x\ = X 3 , ©2 vanishes; i.e., ©1 describes antisymmetric longitudinal vibrations 
(Fig. 15.10). 

(b) For x\ = — JC 3 , ©1 vanishes; i.e., ©2 describes symmetric longitudinal vibrations 
(Fig. 15.11). 


Fig. 15.10. 


-O 


-o 


Fig. 15.11. 


A comparison of kinetic and potential energy yields the normal frequencies 


o>a 


[KTiT 

V m A m B ’ 


co s 



antisymmetric vibration, 
symmetric vibration. 
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A 


A 


For transverse vibrations (see Fig. 15.12) of the form in Fig. 15.13, we set 


o 



B 


L = 


Fig. 15.12. 


Fig. 15.13. 



where 8 is the deviation of the angle <) (ABA) from n. For small values of 8, we can 
set 



= cosaq + cos ot 2 

_yi-y\ yi-y'i 

i + i ’ 

We utilize the conservation of the center of gravity and angular momentum conserva¬ 
tion to eliminate yi and y 3 from L. 

niA(y l + J 3 ) + wiBy2 = 0 (conservation of the center of gravity). 

To exclude rotation of the molecule, the total angular momentum must vanish: 

D = x v«] ~ ^m a [r a0 x x a ] = ^ y^m a [r a0 x x„], 


a 


a 


a 


which can be achieved by 



a 


For our case, it thus follows that y\ — y^. Then we get 



and 



We thus obtain the eigenfrequency of the transverse vibration: 



EXERCISE 


15.10 Normal Frequencies of a Triangular Molecule 


Problem. Calculate the normal frequencies of a symmetric molecule ABA of trian¬ 
gular shape: 
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y 3 

A 



Fig. 15.14. Triangular mole¬ 
cule 


Solution. Conservation of the center of gravity here reads 

m A (x\ + x 3 ) + m B x 2 = 0, m A (y\ + y 3 ) + m B y 2 = 0. 

For angular momentum conservation, we go to the rest position of atom B , and be¬ 
cause m\ = m 3 = m A , it follows that 

rio x xi + r 30 x x 3 = 0. 

We have 

no x xi = |rio|(— x\ cosa + y\ sinoOe, 
r 30 xx 3 = |r 30 |(— x 3 cos a — y 3 sinaje. 

Because |rio| = |r 3 o|, the angular momentum conservation law follows: 

sina(yi — y 3 ) = cosa(vi + x 3 ) or y\ — y 3 = cota(xi + jc 3 ). 


Fig. 15.15. The various coor¬ 
dinates for Exercise 15.10 



The changes 81 \ and 81 2 of the distances AB and BA result by projection of the 
vectors xi — X 2 and x 3 — X 2 onto the directions of the lines AB and B A: 

81 1 = (x\ — X 2 ) sin a + (yi — 3 ^ 2 ) cos a, 

8h = ~(x 3 ~ X 2 ) sin a + (y 3 - ^ 2 ) cos a. 

The change of the angle 2a =<) (ABA) is found by projection of the vectors x\ — X 2 
and x 3 — X 2 onto the directions orthogonal to the line segments AB and BA: 

l r n l r 1 

8 = j[( x i - xi) cos a - (yi - y 2 ) sinaj + y[—(x 3 ~ x i) cos a - (y 3 - y 2 )sinaj. 
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Exercise 15.10 


Fig. 15.16. 


We write the Lagrangian of the molecule as 


L 


m A 
2 


(±i+± 2 3 ) + '^- 


m B 


^[(Sh) 2 + (8l 2 ) 2 ] 


K 2 

2 


(IS) 2 . 


Here, (K\/2)[(8l \) 2 + ( 8 I 2 ) 2 ] is the potential energy of the rotation, and K 2 Q 8 ) 2 /2 is 
the potential energy of the bending of the molecule. We adopt as new coordinates 


Qol =*i+x 3 , q s i=x\-X 3 , q S 2 = yi+y 3 , 

and then have 


1 

m A _ 

1 


x\ = -( Qa+q S \), 

X2 = - Qa, 

X3 = ~(Qa 


2 

m B 

2 


1 

m A 

1 


yi = 2^2 + Qa cot a), 

yi = - q s 2 , 

m B 

J3 = ~(q s 2 

- Q a cota) 


Because yi — y 3 = Q a cot a, we find for L 


_ m A ( 2 m A 1 \ A 2 . m A . 2 . m Al 1 .2 

L - ~T ( “-*" • 2 ) Qol + ~r^s\ + ~7~ Qs 2 

4 \ m b sin a / 4 4 m B 


2 Ki ( 2 ma 1 

CM A I I 


4 V m B sin 2 ( 


, , 2 m A . 2 

1 H-sin a 

m B 


q 2 n 2 

-— ( K\ sin 2 a + 2 K 2 cos 2 a) — q 2 2 —y- ( K\ cos 2 a + 2 K 2 sin 2 a) 

4 4 m z B 

/jl 

+ q s iq s 2 - - (2K 2 - K\) sinacosa. 

2 m B 


Obviously, Q a is a normal coordinate, with the vibration frequency 


Ki_ 

m A 


< = —[ 1 + 


2 m A 
m B 


sin 2 a 


Pure Q a -vibrations occur for x\ = X 3 ,yi = — 373 ; i.e., Q a describes antisymmetric 
vibrations with respect to the y-axis in Fig. 15.16. 



The eigenfrequencies co s \ and co s 2 of the normal vibrations for q s \ and q s 2 must be 
determined by the characteristic equation 


co 4 — co 2 


K 1 ( 2 m A 2 \ , 2K2 (1 , 2m A 

— H-cos z a H- 1 H- 

m A \ m B J m A \ m B 


sin 2 a 


2fiKiK 2 _ 

' 2 

m B m\ 
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The coordinates q s \ and q s 2 correspond to vibrations that are symmetric about the 
y-axis (Fig. 15.17): 


Exercise 15.10 


( X \ = -X3, Qa= 0 => yi=y3)- 



Fig. 15.17. 


EXERCISE 


15.11 Normal Frequencies of an Asymmetric Linear Molecule 

Problem. Find the normal frequencies for a linear, asymmetric molecule with the 3 z 2 

shape in Fig. 15.18. £ 

Solution. Conservation of the center of gravity and of angular momentum now read Fig * 15 * 
m\x 1 + m bx 2 + mcx 3 = 0 , v-center of gravity, 
rrixyi +^ 53^2 + rncyz = 0 , y-center of gravity, 
mAhyi = mchyi, angular momentum conservation. 

For the potential energy of bending, we write 


V = ^(lS) 2 , (2l = h+h); 


for that of rotation, 



The analogous calculation as for Exercise 15.9 after some effort yields 



for the frequency of the transverse vibration, and also the equation quadratic in co 2 



for the frequencies col 1 , col 2 of the two longitudinal vibrations. 


EXERCISE 


15.12 Double Pendulum 


Problem. Determine 

(a) the generalized coordinates of the double pendulum; 

(b) the Lagrangian of the system; 


<N Q Dq 
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Exercise 15.12 


Fig. 15.19. Coordinates of the 
double pendulum 


(c) the equations of motion; 

(d) for m\ = m 2 — m and l\ — h — l\ 

(e) as (d) for small amplitudes; and 

(f) for the case (e) the normal vibrations and frequencies. 



Solution, (a) The appropriate generalized coordinates are the two angles and #2 
that are related to the Cartesian coordinates by 

x\ = l\ cos , yi=/isin#i, 

(15.28) 

X 2 = l\ cos #1 + h cos # 2 , yi = h sin #1 + h sin # 2 - 

(b) From (15.28), it follows by differentiation that 

x\ = —1\&\ sin#i, y\ = l\&\ cos # 1 , 

X 2 = —ll&l sin#i — /2^2Sin?^2, yi = h&\ COS#i + /2#2COS#2- 

The kinetic energy of the system is 

T = h 1 (x^ + yf) + + 

= -mil'l'&i + +1 \&I + 2/i/2??ij?2 cos(j?i — &2 ))• 

(Addition theorem!) 

To get the potential energy, we adopt a plane as a reference height, at the distance 
1 1 + h below the suspension point: 

V = m\g[l\ +h~h cos#i] + m 2 g[h -Eh — (h cos#i + h cos # 2 )]- 
The Lagrangian then becomes 

L = T -V 

= + ^rri 2 [li&i -El\&\ + 2 /i/2^i^2 cos(#i # 2 )] 

— m\g[l\ -Eh — h costfi] — rri 2 g\l\ -Eh~ Qi cos#i + h cos # 2 )]- (15.29) 

(c) The Lagrange equations with & 1 and #2 read 
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One has 

3L . . 

—— = sin(?^i # 2 ) m\gh sin#i m^gh sin#i, 

ou i 

3L 9 . 9 . 

—+ m2hh^2 cos(#i — # 2 ), 

3?^i 

3L . . 

—— = m 2 /i/ 2 ^l^ 2 sin(^i - # 2 ) -m 2 gh sin # 2 , 
du 2 

dL 9 . 

—— = m2/ 2 ^2 + m2hhfti cos(#i — 7 ^ 2 ). 

3 #2 

Thus, the Lagrange equations read 

+ 7772/1 #1 + m 2 hl 2&2 cos(t^i — 7 ^ 2 ) — 7772 / 1 / 2 ^ 2(^1 — # 2 ) sin(#i — 7 ^ 2 ) 

= — 7772 / 1 / 2 ^ 1^2 sin(#i — 7 ^ 2 ) — m\gl\ sin?^i — 7772g/i sin 7^1 

and 

7772/2^2 + 7772/1/2^1 cos(t^i - #2) - 7772/1/2^1 (#1 - #2) sin(^i - #2) 

= 7772/1/2^1^2 sin(#i - 7^2) - 7772 g/ 2 sin# 2 , 

or 

(7771 T 7772)/i 7^1 + 7772/l/2^2 COS(?^l 7^2) + 7772/l/27^2 sin (^l 7^2) 

= —(7771 + 7772 )g/i sin 7^1 (15.30) 

and 

7772 /f #2 + 7772 / 1 / 2^1 cos(#i - # 2 ) 7772 / 1 / 2 #! sin(?^i - 7 ^ 2 ) 

= —m 2 gh sin # 2 . 

These are the desired equations of motion. 

(d) For the case 

7771=7772 = 777 and /j = I2 = l, 

(15.30) reduce to 

2/^i +/t^ 2 Cos(?^i — 7 ^ 2 ) + /#f sin(?Hi — 7 ^ 2 ) = —2gsin#i, 

.. (15.31) 

/#i cos(?^i — 7^2) + /#2 — /t^j sin( 7 ^i — 7^2) = —g sin 7^2. 

(e) If moreover the oscillations are small, then sin 7 ^ = 7 ^, cos 7 ^ = 1, and terms 
proportional to ft 2 are negligible, which leads to 

2/^1 +/ 7^2 = — 2 g # 1 , /&1 + /#2 = — g # 2 - 

(f) With the ansatz 
^ l= AiQ icot , $2 = A 2 e i0)t , 


Exercise 15.12 


(15.32) 
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Exercise 15.12 


Fig. 15.20. Mass point glides 
on a cycloid 


we then obtain 


2 (g — l co 2 ) A i — lco 2 A2 = 0, 


-loo 2 A\ + (g — lco 2 )A 2 = 0. 


(15.33) 


To ensure that A\ and A 2 do not vanish simultaneously, the determinant of the coeffi¬ 
cients must vanish: 


2 (g - la?) -la? 

—la? g — la? 


and therefore, 

l 2 a? - Alga? + 2g 2 = 0 
with the solutions 


0 , 


2 4(g±v/l 6 l¥-»¥ , r< s 

" =-al-= ( 2 ± V2) t 


i.e., 


(W[ = (2 + V2) y , 


= (2- V2)| 


(15.34) 


By inserting (15.34) into (15.33), we obtain 

co 2 : A 2 — — V2A \, i.e., the pendulums oscillate out of phase, 


co 


2 . 


: A 2 = a/2 A], i.e., the pendulums oscillate in phase. 


EXERCISE 


15.13 Mass Point on a Cycloid Trajectory 


Problem. A mass point glides without friction on a cycloid, which is given by x = 
a(ft — sin ft) and y = a( 14- cos ft) (with 0 < ft < 2 tv). Determine 

(a) the Lagrangian, and 

(b) the equation of motion. 

(c) Solve the equation of motion. 




Solution. The cycloid is represented by 
x=a(ft — sin#), y = a{\ + cos ft), 
















15.1 Virtual Displacements 


293 


where 0 < # < 2tt . The kinetic energy is 

T = -m(i 2 + y 2 ) = -m 2 {[( 1 — cos#)#] 2 + [—(sin ^^ ] 2 }, 
and the potential energy is 


V = mgy = mga( 1 + cos #). 
The Lagrangian is given by 

L = T — V = ma 2 ( 1 - cos#)?? 2 


mga( 1 + cos ft). 


The equation of motion then reads 

d /dL\ 9L_ o 
dt \dft) dft ~ ’ 


— [2ma 2 (l — cos ft) ft] — [ma 2 ( sin ft) ft 2 + mga sin ft] = 0 
or 

— [(1 — cos#)#] — -(sin#)# 2 —— sin# = 0, 

dt 2 2 a 

i.e., 

••1 . 9 g 

(1 — cos#)?? + - (sin#)# 2 -sin# = 0. 

2 2a 


By setting u = cos(#/2), one has 



Since cot(#/2) = sin#/(l — cos#), we can write (15.35) as 


.. 1 

# + - cot 
2 


# 

2 



= 0 , 


and therefore, 
d 2 u g 

77 + —u — 0 . 

dt 2 4 a 

The solution of this differential equation is 


(15.35) 


(15.36) 


Exercise 15.13 
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The motion is just like the vibration of an ordinary pendulum of length l = 4a. The 
arrangement is therefore called a “cycloid pendulum.” 


EXERCISE 



15.14 String Pendulum 


Problem. A mass m is suspended by a spring with spring constant k in the grav¬ 
itational field. Besides the longitudinal spring vibration, the spring performs a plane 
pendulum motion (Fig. 15.21). Find the Lagrangian, derive the equations of motion, 
and discuss the resulting terms. 

Solution. We introduce plane polar coordinates for solving the problem and adopt 
the radius r and the polar angle ip as generalized coordinates. 

y = r cos ip, y = r cosip + rip sirup, 

(15.37) 

x = r sirup, x = r sirup — npcosip. 


The kinetic energy is given by 

T = -m(x 2 + y 2 ) = -m(r 2 + r 2 tp 2 ). (15.38) 

The length of the spring in its rest position, i.e., without the displacement caused by 
the mass m, is denoted by ro. The potential energy then reads 

k 9 & 9 

V = — mgy + - (r — ro) = —mgr cos ip + -(r — ro) . (15.39) 

The Lagrangian is then 

L = T — V = -m(r 2 + r 2 (p 2 ) + mgr cos <p — -(r — ro) 2 . (15.40) 

The equations of motion of the system are obtained immediately via the Lagrange 
equations: 

d 2 . 

— (mr (p) =—mgr sirup. (15.41) 

dt 

This is just the angular momentum law with reference to the coordinate origin. If we 
take the time dependence of r into account, then we have 

mrip = —mg sirup — Imrip. (15.42) 


The last term on the right-hand side is the Coriolis force caused by the time variation 
of the pendulum length r. 

For the coordinate r, one obtains 


mr = mnp 2 + mg cos ip — k(r — ro). 


(15.43) 
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The first term on the right side represents the radial acceleration, the second term Exercise 15.14 

follows from the radial component of the weight force, and the last term represents 
Hooke’s law. For small amplitudes <p the motion appears as a superposition of har¬ 
monic vibrations in the r, (p- plane. 


EXERCISE 


15.15 Coupled Mass Points on a Circle 


Problem. Four mass points of mass m move on a circle of radius R . Each mass point 
is coupled to its two neighboring points by a spring with spring constant k (Fig. 15.22). 
Find the Lagrangian of the system, and derive the equations of motion of the system. 
Calculate the eigenfrequencies of the system, and discuss the related eigenvibrations. 

Solution. The kinetic energy of the system is given by 

l 4 

T = -mY J sl (15.44) 

V=1 


m 



Fig. 15.22. 


For small displacements from the equilibrium position, the potential reads 


l 4 

V = -k^2(s v +i -s v ) 2 , 


s 4+1 =s\. 


v=l 


(15.45) 


We set s v = R(p v , and take the angles (p v as generalized coordinates. Then the La¬ 
grangian is 


L = T - V = * l -mR 2 ~ \ kRl l>v+i - Vvf. 

V — \ V—\ 

From the Lagrange equations 
d dL _ dL 

dt d <j)y d(py ’ 

we find the equations of motion: 


d dL 
dt 3 cp v 


= mRcpy 


=- l -kR 2 [2{cp v 

_ dL 

3 <Pv' 


(p v+ \) + 2 (cp v 


<Pv- l)l 


(15.46) 


(15.47) 


(15.48) 
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Fig. 15.23. Uniform rotation 
and out-of-phase vibration 


For the case of four mass points, we then obtain 
k 

<Pi = —(<P2~2 <Pi + <pO, 
m 

k 

<P2 = ~(<P3 ~2<P2 + <Pl), 
m 

k 

<P3 = —(<P4 -2(^3 +<f>2), 
m 

k 

(P4 = —(<pi - 2^4 + 993 ). 
m 


(15.49) 


With the ansatz (p v = A v cos cot, (p v = —A v go 2 cos cot, we are led to the following linear 
system of equations: 


k 2 
- GO 

k 

0 

-- \ 


m 

m 


m 


k 

m 

i k 2 

m 

k 

m 

0 


(AA 

a 2 

0 

k 

m 

i k 2 

m 

k 

m 


a 3 

uu 

k 

m 

0 

k 

m 

O k 2 

2- GO 

m / 



(15.50) 


For the nontrivial solutions, the determinant of the coefficient matrix must vanish. This 
condition leads to the determining equation for the eigenfrequencies: 




(-« 2 )= 0 . 


The frequencies are 

co? = 0, go? = 4 — , 
m 



(15.51) 


(15.52) 


To calculate the related eigenvibrations, we insert these frequencies into the system of 
equations (15.50). 

( 1 ) coj = 0: A\ = A 2 = A 3 = A 4 . The system does not vibrate but performs a uniform 
rotation (Fig. 15.23(a)). 

(2) oo\ — 4k/m: A\ = A 3 = —A 2 — —A 4 . Two neighboring mass points perform an 
out-of-phase vibration (Fig. 15.23(b)). 

1 1 
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(c) co2 = co^ = 2 k/m\ A\ = A 2 = —A 3 = — A 4 or A\ = A 4 = — A 2 = —A 3 . Two 
neighboring mass points vibrate in phase (Fig. 15.24(a,b)). 


Exercise 15.15 


1 


1 


Fig. 15.24. In-phase vibration 


4 


4 



2 


(a) 


3 


(b) 


3 



EXERCISE 

15.16 Lagrangian of the Asymmetric Top 

Problem. Write down the Lagrangian of the heavy asymmetric top. Use the Euler 
angles as generalized coordinates and determine the related generalized momenta. 
Which coordinate is cyclic? Which further cyclic coordinate appears for a symmetric 
top? 

Solution. In the system of principal axes, the kinetic energy of motion is given by 



The potential energy is 
V = mgh = mgl cos p. 

We take the Euler angles (a, P, y) as generalized coordinates. The angular velocities 
expressed by these coordinates read (see (13.43)), 


co\ = a sin P sin y + /3 cos y, 
co 2 = oi sin /3 cos y — p sin y, 
C 03 = a cos P + y . 


By inserting this into the Lagrangian L = T — V, we get 
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The Euler angles as generalized coordinates obey the Euler-Lagrange equations 

d BL _3L 
dt da da 

and the analogous equations for P and y. The Lagrangian does not depend on the 
angle a; hence, this coordinate is cyclic, and the related generalized momentum is 
conserved. 

We determine the generalized momenta: 


Pa = 


dL dco\ dco 2 

— = + ©2<^2^" + ©3&>3 

da da da 

©l (a sin P sin y + P cos y) sin ft sin y 


dco 3 
da 


+ © 2 (a sin P cos y — P sin y) sin P cos y 


+ ©3 (a cos p + y ) cos p 

= asin 2 /J(©i sin 2 y + © 2 cos 2 y) + (©1 — © 2 )/? cosy sin/3 sin y 
+ ©3 (a cos P + y) cos p. 


In an analogous way, we obtain 

dL 9 9 

Pfj =— r = @iP cos y + O 2 P sin y 
dp 

+ (©1 — © 2 )dcosy sin/3 sin y + © 3 (d cos+ y), 
dL 

p y = — = © 3 (acos/3 + y). 
dy 

For the symmetric top, ©1 = © 2 , and thus, the Lagrangian simplifies considerably: 

L = ^©i(dsin 2 P + p 2 ) + ^© 3 {a cos P + y ) 2 — Mgl cos p. 

The Lagrangian of the symmetric top no longer depends on the angle y; therefore, the 
angle y becomes cyclic too. Hence, the momentum p y is also conserved. 

The generalized momenta then read 

p a = a sin 2 P® 1 + © 3 ( 0 ; cos P + y) cos P = constant, 
pp = P® 1 + © 3 (dcosy 0 + y), 
p y = © 3 (d cos P + y) = constant. 

The generalized momenta, being the projection of the total angular momentum onto 
the rotational axis related to the particular Euler angle, have a direct physical meaning. 

Pa is the projection of the total angular momentum onto the space-fixed z-axis (see 
Exercise 13.12): 


Pa — L • — L • e z . 

This projection is a conserved quantity for the asymmetric and the symmetric top. 
Since the gravitational force acts only along the z-direction, the angular momentum 
about this axis remains unchanged. 



15.1 Virtual Displacements 


299 


pp is the projection of the total angular momentum onto the nodal line, i.e., the 
axis about which the second Euler rotation is being performed: 

Pp = L • ep = L • e*. 

This momentum is not conserved. 

p y can be interpreted as the projection of the total angular momentum onto the 
body-fixed e z /-axis: 

p y = L • e K = L • e z /. 

For a symmetric top, the body-fixed z'-axis is a symmetry axis, and the angular mo- 
mentum projection L • e z > is conserved. 


Exercise 15.16 



For systems with holonomic constraints, the dependent coordinates can be eliminated 
by introducing generalized coordinates. If the constraints are nonholonomic, this ap¬ 
proach does not work. There is no general method for treating nonholonomic prob¬ 
lems. Only for those special nonholonomic constraints that can be given in differential 
form can one eliminate the dependent equations by the method of Lagrange multipli¬ 
ers. We therefore consider a system with constraints given in the form 


n 



(16.1) 


V=1 


(v = 1, 2 ,,n = number of coordinates; n > s; l = 1,2,..., s = number of con¬ 
straints). 

The following considerations do not depend on whether the equations (16.1) are 
integrable or not; i.e., they hold both for holonomic as well as for nonholonomic con¬ 
straints. 

Therefore, the method of Lagrange multipliers derived below can be used also for 
holonomic constraints, if it is inconvenient to reduce all q v to independent coordinates 
or if one wants to keep the constraint reactions. Equation (16. 1) is not the most general 
type of a nonholonomic constraint, e.g., constraints in the form of inequalities are not 
covered. 

In our considerations, we start again—as in deriving the Lagrange equations—from 
the d’Alembert principle. According to (15.13), it reads in generalized coordinates as 
follows: 



(16.2) 


This equation holds for constraints of any kind. 

The q v shall now depend on each other. Therefore, the virtual displacements 8 q v 
cannot be freely chosen as earlier (compare (15.13)). To reduce the number of virtual 
displacements to the number of independent displacements, we introduce the—for 
the present—freely chosen Lagrange multipliers A,/. In the general case, the Lagrange 
multipliers A/ with l = 1 ,2 ,... ,s are functions of the time and of the q v and q v . 
Virtual displacements 8 q v are performed at fixed time, i.e., with 8 t = 0. Then (16.1) 
changes to 


n 



v=l 
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These are also called instantaneous (belonging to a fixed time) constraints. This in 
turn leads to 

5 n 

y = o 

1=1 v=l 


or 


y ' ( y ] h^iv \&civ —0* 


(16.3) 


V = 1 \l= 1 / 

Equation (16.3) is now subtracted from (16.2): 


A/ d dT dT A \ 


.,s,...n. (16.4) 


/=! 


These equations involve in total ft of the variables q v \ s of them are dependent q v 
which are connected with the independent ones through the constraints, and n — s are 
independent q v . For the dependent q v , the index v shall run from v = 1 to v = s, for 
the independent q v from v = 5 + 1 to v = ft. The coefficients of the 8 q v in (16.4) 
are such that through the s Lagrange multipliers A./ (/ = 1,..., s) they can be chosen 
as freely as allowed by the s equations for the constraints. Since the A,/ can take any 
value, we can choose them in such a way that 


A d dT dT 

>*ZflZv — “77 7^ 7 Qv 

y—j' dt dq v dq v 


(v = 1, ...,£), 


i.e., the first s coefficients in (16.4) that correspond to the dependent q v are set to zero: 
d dT dT 


dt dq v 3 q v 


- Qv - ^A./fl/ v =0 for v = l,...,s. 


/ 


From (16.4) then remains 


E 

v=s+l 


d dT 
dt dq v 


dT 

dq v 


Qv J2kaiv)8q v —0. 


These 8 q v (for v = s + 1,..., ft) are no longer subject to constraints. This means that 
these 8 q v are independent of each other. One then must set the coefficients of the 8 q v 
(v=s + l,...,ft) equal to zero, just as in the derivation of the Lagrange equation for 
holonomic systems. 

This leads, together with the s equations for the dependent q v , to n equations in 
total: 


d dT dT A 

— —--- Qv~} A/ft/ v =0 for v = 1,..., s, s + 1,..., ft. (16.5) 

dt 3 q v 3 q v 


For conservative systems, the Q v can be derived from a potential: 
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As in the derivation of the Lagrange equation for holonomic systems, we can re¬ 
formulate (16.5) with the Lagrangian L = T — V as follows: 


d 3L 
dt 3 q v 


3 L 

dq v 


s 

^A,/ 0 / v = 0 , v = 

l=l 


(16.6) 


These n equations involve n + s unknown quantities, namely the n coordinates q v 
and the s Lagrange multipliers A./. The additionally needed equations are just the s 
constraints (16.1) which couple the q v \ however, these are now to be considered as 
differential equations: 


^ ~2 a i v q v +ai t = 0, 7 = 1,2 ,...,^. 

V 


Thus, we have in total n + s equations for n + s unknowns. We thereby obtain both 
the q y we were looking for, and also the s quantities A./. 

To understand the physical meaning of the A./, we assume that the constraints of the 
system are removed, but are replaced by external forces Q* which act in such a way 
that the motion of the system remains unchanged. The equations of motion would 
then also remain the same. These additional forces must be equal to the constraint 
reactions, since they act on the system in such a way that the constraint conditions are 
being fulfilled. With regard to these forces Q% the equations of motion read 


d 3 L 3 L 
dt 3 q v 3 q v 


(16.7) 


where the g* enter in addition to the g v . The equations (16.6) and (16.7) must be 
identical. This leads to 


= (16.8) 

/ 

i.e., the Lagrange multipliers A./ determine the generalized constraint reactions g*; 
they will not be eliminated but are part of the solution of the problem (see also the 
statements in Chap. 17 on this topic). The relation (16.3) thus changes to 

Y J Q*M V = 0, (16.9) 

V 

implying that the total virtual work performed by all constraint reactions vanishes. 
This can be considered as the general proof of the thesis introduced in (15.3), that 
constraint reactions do not perform work. 

EXAMPLE 


16.1 Cylinder Rolls down an Inclined Plane 

As an example of the method of Lagrange multipliers, we consider a solid cylinder 
that rolls down without gliding on an inclined plane with height h and inclination 
angle a . This rolling condition is a holonomic constraint, but this is immaterial for the 
demonstration of the method. 



Fig. 16.1. A cylinder rolls with¬ 
out gliding on an inclined plane 
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Example 16.1 

The two generalized coordinates are s, <p. The constraint reads 

Rep = s or Rd<p — ds = 0. 

These equations can, of course, be integrated immediately, and with Rep = s -\- 
constant, the constraint is holonomic. But we stick to the differential form of the con¬ 
straints and demonstrate the method of Lagrange multipliers. In this way we even find 
the constraint reactions. 

The coefficients occurring in the constraint are 

a s = — 1 , a^p — R, 

as is seen by comparison of coefficients with (16.1): 

^2ai v 8q v =0, 
y 

where / = 1 is the number of constraints, and 8 t = 0. The kinetic energy T can be 
represented as the sum of the kinetic energy of the center-of-mass motion and of the 
kinetic energy of the motion about the center of mass: 

t 1 -2_l m (-2 , R 2 2\ 

T = 2 ms + _©* = y 

with the mass moment of inertia of the solid cylinder 

1 2 

®solcyl — ’ 

The potential energy V is 

V = mgh — mgs sin a. 

The Lagrangian reads 

m ( . 9 R 2 . 9 \ 

L — T — V = — ls z + — </ 9 Z 1 — mg(h — s sin a). 

One should note that this Lagrangian cannot be used directly to derive the equation of 
motion according to (15.17). The reason is that the two coordinates s and ep are not 
independent of each other. Thus, ep is not an ignorable coordinate, although it does not 
explicitly appear in the Lagrangian. 

Since there is only one constraint, only one Lagrange multiplier A is needed. With 
the coefficients 

a s = — 1, a^p = R, 

we obtain for the Lagrange equations 

ms — mg sina + A = 0 , (16.10) 

t 2-R 2 (p-\R= 0, (16.11) 



16 Lagrange Equation for Nonholonomic Constraints 


305 


which together with the constraint 
Rep — s 


Example 16.1 


(16.12) 


represent three equations for three unknown quantities q>,s,k. Differentiation of 
(16.12) with respect to the time yields 

Rip = s. 

From this, it follows, together with (16.11), that 
ms = 2 k. 

Hence, (16.10) changes to 
mg sin a = 3k. 

From this equation, we obtain for the Lagrange multiplier 


1 

k = -mg sin o'. 

The generalized constraint reactions are 




Here, a s k is the constraint reaction caused by the friction; a^k is the torque generated 
by this force, which causes the rolling of the cylinder. One should clearly understand 
that the constraint “rolling” demands a particular constraint reaction (friction force). 
We have evaluated it here. We further note that the gravity is reduced exactly by the 
amount of the constraint reaction a s k. 

Inserting the Lagrange multiplier k into (16.10), we obtain the differential equation 
for s: 



The differential equation for ep is obtained from this by inserting 
Rip = s, 

.. 2 g . 

ip — -sin a. 

Y 3 R 

We have seen by this example that the method of Lagrange multipliers yields not only 
the desired equations of motion but also the constraint reactions, which otherwise do 
not appear in the Lagrangian. 

EXERCISE 

16.2 Particle Moves in a Paraboloid 

Problem. A particle of mass m moves without friction under the action of gravitation 
on the inner surface of a paraboloid, which is given by 
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2 , 2 

x +y =ax. 

(a) Determine the Lagrangian and the equation of motion. 

(b) Show that the particle moves on a horizontal circle in the plane z = h, provided 
that it gets an initial angular velocity. Find this angular velocity. 

(c) Show that the particle oscillates about the circular orbit if it is displaced only 
weakly. Determine the oscillation frequency. 


Solution, (a) The appropriate coordinates are the cylindrical coordinates r,ep,z. The 
kinetic energy expressed in cylindrical coordinates reads 

T = -m(r 2 + r 2 cp 2 + z 2 ). 

Hence, the Lagrangian is 


L = 


-mix 2 + r 2 (p 2 + z 2 ) - mgz. 


(16.13) 


The constraint is x 2 + y 2 = ax. Since x 2 + y 2 = r 2 , we have r 2 — az = 0, or in 
differential form, 2r8r — a8z = 0. 

Adopting the notation r = q\, ep = q 2 , z = q$, from 


'y ' A a q a — 0 

a 


we find that A\ — 2 r, A 2 = 0, A 3 = —a. 
The Lagrange equations read 


d ( dL \ 
dt \ dq a ) 


- — =k\A a , a = 1,2, 3; 
dqot 


i.e., 


d (dL\ 
dt\dr ) 

d / dL\ 
dt\d(p) 



1L 


dL 

dcp 


= 0, 


and 



With (16.13), we obtain 


m(r — rep 2 ) = 2k\r, 


d 9 . 
m — (r (p), 
dt 


m'z = —mg — X\a, 


(16.14) 


(16.15) 


and the constraint 2 rr — az = 0. From this system, we can determine r,cp,z,k 1 . 

(b) The radius of the circle arising by intersection of the plane z = h with the 
paraboloid is r 2 = az, 

ro = Vah. 
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From mi = —mg — \\a, it follows with z — h that 


A = mg 
a 


From m(r — rep 2 ) = 2Air, it follows with ep = co,r = ro that 


m(—r()CD 2 ) = 21 — 




2 2 S 
ro or co = —; 

a 


i.e., 


co = 

is the desired initial angular velocity. 

(c) From md(r 2 ep)/dt = 0, it follows that r 2 ep = constant = A. We suppose that the 
particle has the initial angular velocity co\ i.e., 

ah co 

A — aheo , and therefore cp = —. 

r z 

Since the particle oscillates about z = h with only small amplitude, we use Ai = 
—mg/a, which holds for z — h, and we obtain 



m(r — rep 2 ) = — 


2 mg 

-r 

a 



2 gr_ 
a 


Since the oscillation is small, we have r = ro + u; i.e., 


u — 


a 2 h 2 co 2 
(ro + u ) 3 


2 g 

-(r 0 + m). 

a 


We have 
1 


(ro + u) 3 r^(l + u/ ro) 3 rQ 


(16.16) 


u 

1 + - 
r 0 


-3 


(i--)i 

V ro ) r^ 


since w/ro 1 (power series expansion)! 

Thus, from (16.16), we obtain with ro = \[ah, co — +J2g/a, the differential equa¬ 
tion 


u H- u = 0 

a 

with the solution 


%g t _, ■ / 8 g 

U — 8\ cos J —t + £2 sin J — t , 
y a y a 


(16.17) 


Exercise 16.2 


and thus, 
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i.e., r oscillates with co 2 = 8 g /a about the equilibrium value ro = \fah. The oscilla¬ 
tion period is 


T 0 = 7t 

V 2 g 

while the orbital period is 


T u = 2 tt — — 2Tq. 
2 8 


EXERCISE 


16.3 Three Masses Coupled by Rods Glide in a Circular Tire 


Problem. Three mass points mi, m 2 , m 3 are fixed to the ends of two massless rods 
and glide without friction in a circular tire of radius R , which stands vertically in the 
gravitational field of the earth. Find the equations of motion by means of Lagrange 
multipliers, and determine the equilibrium position. Find the frequency of small oscil¬ 
lations about the equilibrium position. 



Solution. We use the angles (p\,cp 2 , and ^3 as generalized coordinates. The angles 
are not independent of each other, but are coupled by the rigid rods connecting the 
mass points, via the constraints 


(p?> — cp2 = a = constant, 
cp 2 — (pi = p = constant. 

In differential form, ai v 8q v = 0, they read 

8(p?> — 8cp2 = 0 , 

8(p2 ~ 8(p\ = 0 . 


(16.18) 


(16.19) 


The Lagrangian of the system can be immediately given in these coordinates: 


L = Y j T v -V v 

V 

= I 'Y2 m vR 2( f> 2 - y~Vvg/?(l - cos<p„). (16.20) 

V V 

The Euler-Lagrange equations, generalized to nonintegrable constraints, i.e., con¬ 
straints that are given only in the form (16.19), can be formulated by means of the 
Lagrange multipliers A/ (16.6): 
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The number of constraints in the case considered here is s = 2. We thus obtain the 
three equations of motion: 

m v R 2 cp v + m v gR sin cp v = X\a\ v + v = l,2, 3. (16.22) 


From (16.19), we obtain by comparison the coefficients ai v : 


an=0, Cl\2 = —1, a 13 = 1. 

Cl2 1 = — 1 , <222 = 1, <223 = 0. 

Then (16.18) implies that 


V3=V2 = Vl' 


(16.23) 


(16.24) 


By inserting this into (16.22), we get 


g . . *2 

-sin^i + —-2 
7? m\R z 


g . . m 

= fi smK+ ^ 


^2 g . M 

- 3 - = — sin cc >3 - 7 T 

m 2 R 2 R m 3 R 2 


(16.25) 


Solving for X\ and A 2 leads to 


F 


m 3 Af 


sin^i) + m 2 (sin <^3 


sin ^ 2 )], 


^2 

F 


= mi ^t m 2(sin^2 — sin^i) 


+ m 3 (sin <^3 — sin^i)]. 


(16.26) 


Next, we set M = m\ + m 2 + m 3 and &>q = g/R. The angles <^3 and <^2 can be ex¬ 
pressed by (p\ via the constraint (16. 1 8), so that one differential equation in the variable 
<p\ describes the entire system. Hence, from (16.22) and (16.26) we obtain 


cp\ = — ^\ mi + m 2 sin ^2 + ^3 sin ^ 3 ] 


=-- [mi sin (p\ + m2 sin(c?i + B) + m3 sin(c?i + a + p )]. 

M 


(16.27) 


The equilibrium position is at the point of vanishing acceleration cpi = 0: 


m \ sin<pi + m 2 (sin<pi cos/I + cos^i sin ft) 

+ m 3 [sin cp\ cos(a + ft) + cos cp\ sin(a + P)\ = 0. 


Solving for cp\ yields 


^il<^i=o = ^i = arctan 


m 2 sin P + m 3 sin(a + P) 
m 1 + m 2 cos p + m 3 cos(o' + /3) 


(16.28) 


(16.29) 


We now consider small vibrations # about the equilibrium position determined by 
(16.29): 


Exercise 16.3 


cp l =(p® + $ with |#|<Cl. 


(16.30) 
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By means of the addition theorem sin(© + ft) ~ sin© + #cos© and <p\ = we 
obtain from (16.27) the desired frequency: 

GO 1 

& = [mi cos (p® + m 2 cos(<^ + a) + m 3 cos((p® + a + /3)]$, 

V = -0, 2 V. (16.31) 

For small amplitudes, this differential equation describes the vibrations of a physi¬ 
cal pendulum. For a = /3 = 0, and, hence, (p® = 0, it turns into the equation of the 
mathematical pendulum. 
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17 


17.1 Velocity-Dependent Potentials 


So far, we defined conservative forces F by the condition that they can be derived from 
a potential V (r) by forming gradients, i.e., 

F(r, t) = —V V(r, t). (17.1) 


The potential V (r, t) is a function of the position and in general also of the time. This 
is possible as long as the forces do not depend on velocities or accelerations. There are, 
however, such cases: for instance, the Lorentz force which acts on a charged particle 
in the electromagnetic field is velocity-dependent: 

E+^xB^j. (17.2) 



Here, e is the charge of the particle, and E and B are the electric and magnetic field 
strength, respectively. F ^ indicates that this shall be an external force. 

If external forces depend on the velocity or the acceleration, we shall call them 
conservative as well if they can be expressed by a potential V that depends on the 
generalized coordinates qj , the generalized velocities qj and the time t , according to 


dV d dV 
dqj dt dqj 


(17.3) 


with V = V (qj , qj , t). 

In some cases, such a representation can be possible also for the ordinary coordi¬ 
nates Yi and the velocity v z -. The relation for V = V (r z , v z , t) analogous to (17.3) then 
reads 


F| = 


_v,v + -v T ,v = - 


dv d dv 

dri dt dVi ’ 


Here, 


V =(J- J_ J_) 

V ' \dv ix ’dv iy ’dv iz f 

means the gradient vector with respect to the components of the velocity of the ith 
particle. 

The velocity-dependent potential 


V — V(qj, qj, t) 


(17.4) 
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is sometimes called the generalized potential. We know from (15.14) that the kinetic 
energy T and the generalized forces Qj are related by 


d 3T dT _ 

dt dqj dqj 

Now using (17.3), we obtain 

d dT dT _ dV d dV 

dt dqj dqj dqj dt dqj 


(17.5) 


d dL dL _ Q 

dt dqj dqj 

if we define the generalized Lagrangian L by 


L — T — V 


with the generalized potential V (qj , qj , t). 

Sometimes, it is desirable to use another set of coordinates qj instead of the set 
of generalized coordinates qj . We now will show that this potential represents a gen¬ 
eralized potential also in the new coordinates qj and the related velocities qj . This 
property is therefore independent of the selected special coordinates. 

As in (14.7), the generalized forces Qj belonging to qj,cjj and the forces Qj from 
(17.3) are related by 


Qj 


3 N 

=Y.Qv 

v =l 


dqv 

dq v 


We have to show that 

~ dV d / dv\ 

Qi ^~W} + ~dt\Wj)' 

For the proof, we need the relation 

dqk _ dqk_ 

dqj dqj ’ 

which immediately follows from 



(17.6) 


(17.7) 


(17.8) 


(17.9) 
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Because 


dV d /dV 
dq v + dt\dq v 


we write the generalized force Qj (17.6) as follows: 


Qj = - 


^3 V dq v ^d jdV\dq v 

, 3#v dqj — \3^v / 3$} 

V=1 J v= 1 J 


^ dV d q v yW / dV 3 q v 
, 3 q v dqj , dt \3^ v 3 qj 

V=1 J V=1 J 


^dV d (dq v 
, 35v \3§} 

V=1 J 


By inserting this expression into (17.9), we get 


X. = _?L + Y'!!L(jL(x'!!±L*' , A 

7 9 ?/ Mv yqj dq a qa 3 t qv 

^djdVdq v \ ^ dV d fdq v \ 
dt \3<7i> dqj ) 3 q v dt \3 qj J 


The third term yields with (17.8) 


^d/dv dq. 
~[dt\dq v dqj 


q_v\ d/ dV 
?// dt\dq j 


Thus, Q: becomes 


3V d( dV 
Qj ~ ~ + “77 I 77*7“ 


dqj dt\dq 


, /_3_/^89v* 3 \\ 

9^v y3?v 3§a^“ 3^7) 


3iV ^ T/ / 3iV ^ / « 

3 V /3 /3g v 


_ V ( V (; / d dq v \ 

dqv dq a \dqj) qa dtdqjj' 


Since q a does not depend on qj , the last two terms cancel each other, and we find that 
(17.7) is valid. Thus, it has been shown that 


V(qj,qj,t ) = V(qj(qj,t),qj(qj,t),t) = V(qj,qj,t ) 


also represents a generalized potential in the new coordinates qj . 
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EXAMPLE 


17.1 Charged Particle in an Electromagnetic Field 

In the lectures on electrodynamics 1 we shall show that the electric field strength E and 
the magnetic field strength B can be derived from the scalar potential 0(r, t) and the 
vector potential A(r, t ), namely, 


1 3A 

E = —V O-, 

c 3 t 


B = V x A. 


(17.10) 


In other words, the electromagnetic phenomena can be described by O, A instead of 
E, B. Now we show that in the frame of the Lagrangian formalism the Lorentz force 
(17.2) can be described by the velocity-dependent potential 


V = <?0-A • v. 


(17.11) 


The Lagrangian then reads 


L — T 


1 2 
V = -m\ 
2 


e<& + -A • v. 

c 


We restrict ourselves to the Lagrange equation for the v-component: 
d dL dL _ q 

dt dv x dx 

The other components follow likewise. We calculate 


3L 30 e 3A 

— = —e -1-- v, 

dx dx c dx 


dL e 

-— =mv x + -A x , 
3 v x c 


(17.12) 


(17.13) 


and furthermore according to (17.13), 

e dAv 


d 30 e 3A 

mv x = -e- -h 


dt 


dx c dx 


v — 


c dt 


For the last term, we obtain 
dA x 


3 A x 3 A x dx 3 A x dy 3 A x dz 

dt dt dx dt 3y dt dz dt 


dAx 

dt 


+ 


dA* 

dx 


-v x + 


dy 


Vy + 


dz 


and for the intermediate term, 


3A 

dx 


v = 


dAx 

dx 


-Vx + 


3A^ 

dx 


'Vy + 


dA 7 


dx 


Equations (17.15) and (17.16) are now inserted into (17.14) and yield 


dmv x 

dt 


30 




dx 


1 dA^ 
c dt 


e 

+ - 
c 


dAy 

dx 


dAx 

dy 


e 

vy-~ c 


9Ax 

dz 


dAz 

dx 


(17.14) 


(17.15) 


(17.16) 


— e E x + (BxVy ByV z ) 


= el E + -v x B 


1 See W. Greiner: Classical Electrodynamics, 1st ed., Springer, Berlin (1998), Chapter 23. 
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Corresponding expressions are obtained for the y- and z-components, so that we get Example 17.1 

in total 

— (mv)= (E+ivxB), (17.17) 

dt \ c ) 

i.e., Newton’s equation of motion with the Lorentz force. 


17.2 Nonconservative Forces and Dissipation Function (Friction 
Function) 

So far, the discussion was restricted to conservative forces only. We now consider 
systems with conservative and nonconservative forces. Such systems are first of all 
systems with friction. They play an important role in classical physics and recently 
also in heavy-ion physics. If two atomic nuclei collide, many internal degrees of free¬ 
dom are excited; one can say that the nuclei are being heated up. Energy of relative 
motion is lost. This is a signature for friction forces, which are generally considered 
as being responsible for the energy loss. 



(a) 




Fig. 17.1. (a) Trajectory of 
nucleus 2 in the Coulomb field 
of nucleus 1. (b) Trajectory of 
nucleus 2 in the Coulomb plus 
nuclear field of nucleus 1 


We begin our discussion of nonconservative (e.g., friction) forces with the La¬ 
grange equations in the form 


d dT 
dt dqj 





(17.18) 


(c) 

and split the generalized forces Qj in a conservative part Q ■ and a nonconservative 
part Q ( P (/ for friction): 


Qj = Q ( f + Q\ f) . 


(17.19) 


(c) 

Since Q\ can be derived by definition from a potential according to (17.3), we can 
introduce L — T — V and bring (17.18) into the form 

d dL dL 
dt dqj dqj 


y' = l,2,...,n. 


(17.20) 




316 


17 Special Problems 


If the nonconservative forces are friction forces, on the right-hand side appear only 

(f) 

these friction forces Q ■ . For these, we make the ansatz 

n 

Q { P = -£/;*%, (17-21) 

k=l 

where the fjk are the friction coefficients. If the friction tensor fjk is symmetric, i.e., 
fjk = fkj, the friction forces Qj can be obtained by partial derivation with respect 
to the generalized velocities qj from the function 

1 n 

D =~J2 Mm (17.22) 

z k,l=1 


according to 



dD 

Hj' 


D is called the dissipation function (friction function). The Lagrange equations 
(17.20) can now be written as 


d dL dD dL 
dt dqj dqj dqj 


(17.23) 


In order to understand the physical meaning of the dissipation function, we calculate 

(f) 

the work performed by the friction force Qj per unit time, 


dW (r) 
dt 


T,qT*j 


T,fjkWk- ^D, 


(17.24) 


i.e., the energy consumed by the friction force per unit time is twice the dissipation 
function: 

dE d 

— = —(T + V) = -2D. (17.25) 

dt dt 

This can also be directly derived from the Lagrange equations: 


d 

dt 


v-^ dT . v-^ 

<r + v, = £ 


dT .. d 

— 4i + — v. 
dq t dt 


(17.26) 


With (17.23) we find 


^ dT .. _ d 
dqi dt 



d 

dt 


(2 T) + 

i 



v—> dT . ^\ dV . 

~^ qi + 2 ^~ qi 
. Hi Hi 


d 

dt 


dT 


(2T)+2D-Y—qi + 
“ Hi 



(17.27) 
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By inserting this into (17.26), we obtain 
d(T + V) = d(2T + 2V) | ^ 
dt dt 


or 


dE 

dt 


= -2D, 


i.e., the result (17.25). 


EXAMPLE 


17.2 Motion of a Projectile in Air 

The particle shall be under the action of the conservative gravitational force with the 
potential 

V = mgz 

and the nonconservative friction resistance of air. The air resistance depends on the 
projectile velocity. We suppose the friction force to be proportional to the velocity. It 
can then be derived from the dissipation function 

D = ^a(x 2 + y 2 + z 2 ), 

and the Lagrange equations follow with L = (1/2 )m{x 2 + y 2 + z 2 ) — mgz according 
to (17.23) as 

mx + ax = 0, my + ay = 0, mi + az + mg = 0. 

These equations of motion are known from the lectures on classical mechanics . 2 


17.3 Nonholonomic Systems and Lagrange Multipliers 

In the preceding text, we have already discussed holonomic and nonholonomic sys¬ 
tems. A brief recapitulation seems appropriate: For holonomic systems, the supple¬ 
mentary conditions can be expressed in the closed form 

gi(r v , t) = 0, 1 = 1,2,...,$, v = l,2 ,...,#. (17.28) 

N = number of particles. We therefore can eliminate s coordinates and express the 
r v as functions of n = 3 N — s independent generalized coordinates qt . For nonholo¬ 
nomic systems, this is not possible, since the supplementary conditions appear in the 
differential form 

N 

y]giz(r v ,0 -dri +g it (r v ,t)dt = 0, i = l,2,...,s. (17.29) 

l=i 


2 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 
(2004), Chapter 20. 
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Since these equations shall be nonintegrable, one cannot eliminate s dependent co¬ 
ordinates from them in the form (17.29). One therefore simply expresses the r* as 
functions of 3 N generalized coordinates qt. The qt are of course not all independent, 
but are subject to supplementary conditions which are obtained by rewriting (17.29) 
in terms of the qt : 

3 N 

Yan(q, t)dqi +a it (q,t)dt = 0, i = 1,2, ...,s. (17.30) 

1=1 

For virtual displacements 8 qi , i.e., 8 t = 0, these supplementary conditions change to 

3 N 

Yan(q, t)8qi =0, i = 1, 2 ,..., s. (17.31) 

1=1 

In this form, the supplementary conditions can be combined with the Lagrange equa¬ 
tions in the same form, namely, 



(17.32) 


The conservative forces were taken into account in the Lagrangian L. Because of 
the conditions (17.31), not all 8 qt in (17.32) are independent. To take this fact into 
account, one multiplies in (17.31) by the—at the moment still unknown—factors ki 
and sums up over i , 


5 3N 

EE kian(q,t)8qi =0. (17.33) 

i=l l=l 

Addition of (17.32) and (17.33) then yields 



d dL 
dt dqj 


T ^ ' kidij{q , t) j8 qj — 0. 


i=l 


(17.34) 


The factors ki are called Ldgmnge multipliers. They can be chosen arbitrarily in 
(17.34). Among the 3 N quantities 8 qj, however, only 3 N — s can be chosen arbitrar¬ 
ily, since the s supplementary conditions (17.31) still must be satisfied. We number 
the 8 qj so that the first s of them are just the dependent ones; the last (3 N — s ) of the 
8 qj can be freely chosen. 

Now we utilize the free choice of the s Lagrange parameters ki , which are deter¬ 
mined in such a way that the coefficients of the first s variations 8 qj in (17.34) vanish. 
This obviously leads to the s equations 


(r) dL d dL A 

Qj -—f Ykjdjjiq, t) — 0, j — 1,2,.. 

J dqj dt dqj “ 


• ,s. 


and (17.34) reduces to 


y- / Q(r) , 9L 

i^lrl \ 1 dq j dt dq j 


Xiaij(q,t ) \Sqj =0. 


(=1 


(17.35) 


(17.36) 
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In (17.36), all of the 8 qj can now be freely chosen. Therefore, the expression in the 
round bracket must vanish for every individual /; i.e., it follows that 

e ( r)+ 9 L_^ 9 L + y. =Q , j=s + hs+2t . 3#. (17.37) 

J dqj dt dqj ^ 

j j i = i 

Now we see that the two sets of (17.35) and (17.37) have the same form and can be 
simply combined to 

dL d dL (r) A 

— --— + Qy + YXi a ij(q,t)=0, j — 1,2,, 3N. (17.38) 

dqj dt dqj J ^ 

These are 37V equations which, together with the s supplementary conditions in the 
form 

3 N 

y^aniq, t)qi +a il (q,t) = 0, i — \,2,...,s, (17.39) 

1=1 

determine the 3N + s unknown quantities, namely the 37V coordinates qj and s La¬ 
grange multipliers A 4 . Hence, the total number of desired quantities (qj , A./) is 37V -\-s. 
This is also the number of (17.38) and (17.39) which determine these quantities. 

The meaning of the Lagrange multipliers can be understood even more precisely if 
we interpret the last term in (17.38) as an additional force Q,J\ namely, 

Qf =y>a, 7 (<l,0. (17.40) 

i =1 

These forces are constraint reactions which appear since the motion of the sys¬ 
tem is restricted by supplementary conditions. Indeed, if the supplementary conditions 
disappear (aij = 0), the constraint reactions also vanish; = 0. The former equa¬ 
tion (17.33) can now be written as 

3 N 

l]e ( a) %= 0 (17.41) 

i =1 

and can be interpreted as the vanishing of the virtual work of the constraint reactions. 

It is clear that the method of Lagrange multipliers developed here for nonholonomic 
systems can be applied to holonomic systems, too. The holonomic constraints (17.28) 

gi(r v , t) = 0, i = 1,2, ...,s, v = 1,2,..., N, 

can immediately be written in differential form: 

'y — -dri + —dt = 0, i = l,2,...,s. (17.42) 

hi dri dt 

This is exactly the form (17.29) for nonholonomic systems. From now on, the ap¬ 
proach with Lagrange multipliers can run on as explained above. We then obtain 
(37V + s) coupled equations, while the former solution method for holonomic sys¬ 
tems (based on the elimination of s coordinates from (17.28)) leads only to (37V — s) 
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coupled equations. By the additional 2s equations the procedure now became much 
more complicated. However, this complication has also a great advantage: We now 
can determine the constraint reactions according to (17.40) without difficulty (by 
solving the 3N + s equations). 


EXERCISE 

17.3 Circular Disk Rolls on a Plane 


Problem. Determine the equations of motion and the constraint reactions of a cir¬ 
cular disk of mass M and radius R that rolls without gliding on the x, y-plane (see 
Fig. 17.2). The disk shall always stand perpendicular to the x, y-plane. 


Fig. 17.2. A circular disk rolls 
on the x,y-plane 



Solution. We first consider how to mathematically formulate the constraints “without 
gliding” and “always stands perpendicular to the x, y-plane.” This actually means 
that the center of the disk is exactly above the contact point (x, y) (the disk stands 
perpendicular), and the velocity of the circumference of the disk edge equals the 
velocity of the contact point in the x, y-plane (O is the rotation angle of the disk 
around its axis). The latter means that there is no gliding. If we introduce the angle 
© between the disk axis and the x-axis (see Fig. 17.3), the condition “no gliding” 
mathematically reads 

x = 7? 6 sin ©, y = —R<& cos©. (17.43) 

In another formulation, these differential supplementary conditions read 
dx — Rsin@d<& — 0, 

(17.44) 

dy + R cos ©<id> = 0. 



Fig. 17.3. Projection onto the 
x, y-plane 
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In the form (17.30), these conditions thus read 


Exercise 17.3 


a\\dx + ayidy + $13 JO + aud © = 0, 
a2\dx + d2idy + $23dd> + a 2A d® = 0, 


where 


$n = 1, ci \2 = 0, $13 = —R sin ©, $14 = 0, 

$21=0, <222 = 1, $23 = R COS ©, $24 = 0. 


Thus, according to (17.40) the constraint reactions are 



(17.45) 


The kinetic energy of the disk is 



(17.46) 


where I\ is the moment of inertia of the disk about the axis perpendicular to the disk 
through the center, and h is the moment about the axis through the center and the 
contact point (x , y ). 

The Lagrange equations (17.38) now read explicitly 

Mx = Q x + M, 

My = Qy + 7-2, 


(17.47) 


/iO = go — A.i/?sin© + A.2 ^cos©, 

/ 2 © = g©. 

g*, gy> g</>, g© are possible external forces. We study the case without such forces 
and therefore set them equal to zero. This transforms (17.47) into 

Mx = X \, 

My = A.2, 


(17.48) 


7iO = — Ai^sin© + A^cos©, 
/ 2 © = 0 , 


which must be replaced by (17.43) according to (17.39): 


x = R<& sin ©, 
y = — R<$> cos©. 


(17.49) 


The last equation of (17.48) can be immediately integrated, leading to 


© — cot T - ©o* 
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Exercise 17.3 


By inserting this into (17.49), one can calculate x and y, which determine Ai and A 2 
through the first two equations of (17.48): 

Ai = Mx = M(R & sin(cot + © 0 ) + cdR<& cos (cot + © 0 )), 

(17.50) 

A 2 = My — — M(R&cos(cot + © 0 ) — coR<& sin (cot + © 0 )). 

This in turn is now inserted into the third equation (17.48), which then reads 

7i<f> = — MR(R& sin(cot + © 0 ) + coR® cos(cot + ©o))sin&tf 
— MR(R& cos(a)t + © 0 ) — cdR< i> sin(cot + © 0 )) cos cot 
= -MR 2 < E>; 

i.e., 

(/l +MR 2 )$> = 0. 

This leads to <f> = 0 and hence 6 = constant. Therefore, we can explicitly write down 
the constraint reactions (17.45): 

= McoR<& cos((jot + © 0 ), 

Qf = Ma>R<t> sin(cot + © 0 ), (17.51) 

Q$ = 0 , Q ( i } = 0 . 

These constraint reactions must act to keep the disk vertical on the x,y-plane. If the 
disk rolls along a straight line (co = 0), the constraint reactions disappear. 


EXERCISE 


17.4 Centrifugal Force Governor 


Problem. Consider the degrees of freedom, and determine the equation of motion 
of the centrifugal force governor (Fig. 17.4) through the Lagrangian. 
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Solution. The principle of the central force governor is applied, e.g., in automobiles. 
The distributor drive shaft is tightly fixed to the carrier plate of a central force gov¬ 
ernor which is attached below the interrupter plate. At higher speeds the centrifugal 
masses press on their carrier plate against a “cog.” Thus, the distributor shaft set into 
the driving shaft is moved additionally in the rotation direction by a cam. This mech¬ 
anism causes a preignition needed at higher speeds. For more advanced motors with 
“transistor ignition,” this mechanism is dropped. 


spring 

centrifugal masses 
cog 

carrier plate 

The system has two degrees of freedom, which can be described by the angles 0 
and ip. The motion of m , M is restricted by the constraints represented by the four rigid 
rods and the rotation axis. Hence, 0 and ip offer themselves as generalized coordinates. 
We first determine the kinetic energy. The moment of inertia of the cylinder is 

1 ? 

©zz = -MR 2 , 

and therefore, 

T mt = \(\ mr2+ 2ml2sin2e j ( P 2 - (17.52) 

The kinetic energy due to the motion in the x, y-plane is 

m 9 1 9 

Tplane = 2 y V m + ~Mv m 

d 

v m =lO, vm = —(—2/ cos 0) = 021 sin 0 
dt 

From this, it follows that 

T plane = (m + 2M sin 2 O)l 2 0 2 . (17.55) 

With the potential energy V = —2 gl/(m + M)cosO, we can write down the La- 
grangian: 

L = ^rot Fplane — V 

= ^ (©zz + 2 ml 2 sin 2 0)ip 2 + (m + 2 M sin 2 0)l 2 0 2 

+ 2gl(m + M) cos 6 . (17.56) 

The Lagrange equations 

d dL 3L _ Q 
dt 


(17.53) 

(17.54) 



Exercise 17.4 


Fig. 17.5. Centrifugal gover¬ 
nor in automobiles 
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Exercise 17.4 


immediately yield the equations of motion: 

= 0 , 

= (©zz + 2m/ 2 sin 2 6)<p, 

= 2ml 2 sin 0 cos Ocp 2 + 4Msin0 cos 0l 2 0 2 — 2gl(m + M) sin0, 
= 2 (m + 2 M sin 2 0)l 2 0 . 

From (17.57), we get 

l(^) = l [( ® zz+2m,2sin2wi=a 

For the Lagrange equation in 0 , we need 

— (| = (2m + 4Msin 2 0)/ 2 $ + 8Msin# cos0l 2 0 2 . 
dt\d0j 

Then 


dL 
dcp 
dL 
d (p 
dL 

~d6 

dL 

~d0 


(17.57) 


(17.58) 


(17.59) 


(2m + AM sin 2 0)l 2 0 + 4Ml 2 0 2 sin 0 cos 0 
— 2ml 2 <p 2 sm0 cos0 + 2 gl(m + M) sin0 =0, 


(17.60) 


and hence, we obtain the following equations of motion: 

(2m + 4 M sin 2 0)l 2 0 + 2l 2 (2M6 2 — mcp 2 ) sin 0 cos 6 + 2 gl(m + M) sin^ = 0, 

= 0 (17.61) 


d 

dt 


MR 2 + 2m/ 2 sin 2 0 \<p 


C 


(p: 


(1/2)MR 2 + 2m/ 2 sin 2 6 


From these equations of motion, the advantage of the Lagrangian formalism becomes 
evident. To account for the complicated constraint reactions in Newton’s formulation 
would be much more laborious. 









Hamilton's Equations 



The variables of the Lagrangian are the generalized coordinates q a and the accom¬ 
panying generalized velocities q a . In Hamilton’s theory, 1 the generalized coordinates 
and the corresponding momenta are used as independent variables. In this theory the 
position coordinates and the “momentum coordinates” are treated on an equal ba¬ 
sis. Hamiltonian theory leads to an essential understanding of the formal structure of 
mechanics and is of basic importance for the transition from classical mechanics to 
quantum mechanics. 

We now look for a transition from the Lagrangian L{qi , cp , t) to the Hamiltonian 
H(qi , pi , t) and remember that the generalized momenta are given by 



We look for a transformation 


L(qi,qt,t ) => H^qi,^-,t^= H{q t , pi,t). (18.1) 

The question is, how to construct HI The recipe is simple and will be formulated in 
the following equation (18.2). The mathematical background of such a transformation 
C Legendre 2 transformation) can be easily demonstrated by a two-dimensional exam¬ 
ple. We change from the function f(x,y) to the function g(x,u) = g(x, df/dy ): 

df 

f(x,y) => g(x,u) with Li = —, 

By 

where g(x, u) is defined by 
g(x, u) = uy - f(x,y). 


1 Sir William Rowan Hamilton , b. Aug. 4, 1805, Dublin-d. Sept. 2, 1865, Dunsik. Hamilton began 
his studies in 1824 in Dublin. In 1827, before finishing his studies, he became professor of astronomy 
and King’s astronomer of Ireland. Hamilton contributed important papers on algebra and invented 
the quaternion calculus. His contributions to geometrical optics and classical mechanics, e.g., the 
canonical equations and the Hamilton principle, are of extraordinary importance. 

2 Adrien Marie Legendre , b. Sept. 18, 1752-d. Jan. 10, 1833, Paris. Legendre made essential contri¬ 
butions to the foundation and development of number theory and geodesy. He also found important 
results on elliptic integrals, on foundations and methods of Euclidean geometry, on variational cal¬ 
culus, and on theoretical astronomy. For instance, he first applied the method of least squares and 
calculated voluminous tables. Legendre dealt with many problems that Gauss was also interested 
in, but he never reached his perfection. Beginning in 1775, Legendre served as professor at various 
universities at Paris and published excellent textbooks which had a long-lasting influence. 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_18, © Springer-Verlag Berlin Heidelberg 2010 
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18 Hamilton's Equations 


By forming the total differential, we realize that the function g formed this way no 
longer contains y as an independent variable: 


dg = ydu + udy — df 

df 9/ 

= ydu + udy - dx - dy 

dx dy 

9/ 

= ydu - dx, 

dy 


where now y = dg/du and dg/dx = —df/dx. 

According to this short insertion, we now construct the Hamiltonian from the La- 
grangian. We write for the Hamiltonian 


H{qi,Pi,t) = 'Y^p i q i -L(qi,qi,t). (18.2) 

i 

We look for those equations of motion which are equivalent to the Lagrange equations 
based on the Lagrangian L. To this end, we form the total differential: 


dH = J2Pid4i +^4idpi ~ dL. 


(18.3) 


The total differential of the Lagrangian reads 
dL dL dL 

dL = 2_'—dq i + \'—dq i + —dt. (18.4) 

^ dqi ^ dqi 3 1 

We now utilize the definition of the generalized momentum, pi = dL/dcp, and the 
Lagrange equation in the form 

d dL 

— Pi -= 0. 

dt dqt 

Inserting both into (18.4) yields 

9L 

dL=) Pidq t + 2_ j Pi d( U + 


By insertion of dL into (18.3), it follows that 

d L 

dH = y^pidcn + yy^qtdpi - y^pidqi - ^ pidcp - ~^dt. 

Since the first and fourth term mutually cancel, there remains 

9L 

dH = 2_^qidpi - 2_^pidqi - —dL 

i i 

Therefore, H depends only on pi , qi , and t; thus, H = H(qi , pi , t), and we have 

dH v dH dH v . . dL 

dH = 2_^ —+ 2^ Xp dpi + ~dT dt = L qidpi — l—i Pi dq i ~ ~^ dt - 
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From this immediately follow the Hamilton equations : 


3 H 

dH 

dH 

dL 

dpi' 

Pi = -T—> 
dqi 


~~dt 


(18.5) 


They are now the fundamental equations of motion in this formulation of mechan¬ 
ics. The Hamiltonian H here plays the central role, similar to the Lagrangian L in 
Lagrange’s formulation of mechanics. This Hamiltonian H is constructed according 
to (18.2); but with the prescription that all velocities cp are expressed by the general¬ 
ized momenta pi and the generalized coordinates qi through (18.1). In other words, 
the equations (18.1) for the definition of the generalized momenta 


Pi = 


dL(qi,qi,t ) 
dqt 


are solved for the generalized velocities cp , so that 


qi =qi(qt, Pi). 


The cp obtained this way are inserted into the definition of H (see (18.2)), so that the 
Hamiltonian H finally depends only on qi , pi , and the time t\ hence, H = H(qi , pi , t). 
From this, the Hamilton equations (18.5) are established and solved. 

The Lagrange equations provide a set of n differential equations of second order in 
the time for the position coordinates. The Hamiltonian formalism yields 2 n coupled 
differential equations of first order for the momentum and position coordinates. In any 
case, there are 2 n integration constants when solving the system of equations. 

From (18.5), it is seen that for a coordinate that does not enter the Hamiltonian, the 
corresponding change of the momentum with time vanishes: 

dH „ 

-=0 => pi— constant. 

dqi 

If the Hamiltonian (the Lagrangian) is not explicitly time dependent, then H is a 
constant of motion since 

dH _ dH . dH . 3H 

dt ^ dqi ^ ^ dpi dt 


and with (18.5) this leads to 

dH _ dH 
dt dt 

Now it is clear that with dH/dt = 0 (since H shall not be explicitly time dependent) 
it follows that dH/dt = 0, and thus, H = constant. 

What is the meaning of the Hamiltonian; how can it be interpreted physically? 
To see that, we consider a special case: For a system with holonomic, scleronomic 
constraints and conservative internal forces, the Hamiltonian H represents the energy 
of the system. 

To clarify this, we first consider the kinetic energy: 

T = - ^m y r^, v = 1, 2,..., Af (N = number of particles). 
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If the constraints are holonomic and not time-dependent, there exist transformation 
equations r v = r v (qi), and therefore, 


r v 


i dqt 


Insertion into the kinetic energy yields 



— ^ ' ^ikQiQk- 
i,k 

Thus, the kinetic energy is a homogeneous quadratic function of the generalized ve¬ 
locities. The arising mass coefficients 


1 v 3r v 3r v 

a ik = - m v A ’ X 

2 “ dqi dqk 

are symmetric; i.e., a^ = a^. 

Now we can apply Euler’s theorem on homogeneous functions. If / is a homoge¬ 
neous function of rank n, i.e., if 

f(Xx\,kX2, • --,Xx k ) =X n f(xi,X2, 


then also 

k 


df r 

z-‘ x '^= nf - 


i—\ 


This can be shown by forming the derivative of the upper equation with respect to A; 
thus, 


9/ 

3(Ajci) 


-*! + ••• + 


9/ 

d(Xx k ) 


Xk = nX n 1 


/• 


By setting X = 1, the assertion follows. Euler’s theorem, applied to the kinetic energy 
(n — 2), means 


E 


dT 

dqi 


■ qt = 2 T. 


( 18 . 6 ) 


Since the forces are presupposed to be conservative, there exists a velocity-independent 
potential V(qi), so that 
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and therefore, 


E . v—"\ o i 

Pi<2i ~ L = 2 _^ 7-r-qi - L. 


By using the relation (18.6) and the definition of the Lagrangian, we see that 

H = 2T - (T - V) = T + V = E. 

Thus, under the given conditions the Hamiltonian represents the total energy. The 
energy T — V represented by the Lagrangian is sometimes called the free energy. 

One should note that H does not include a possible work performed by the con¬ 
straint reactions. 

The Hamiltonian formulation of mechanics emerges via the Lagrange equations 
from Newton’s equations. This became evident in deriving (18.5), where we explicitly 
used the Lagrange equations. The latter ones are however equivalent to Newton’s for¬ 
mulation of mechanics (see d’Alembert’s principle and following text). Conversely, 
one can easily derive Newton’s equations from Hamilton’s equations and thus show 
the equivalence of both formulations. It is sufficient to consider a single particle in 
a conservative force field and to use the Cartesian coordinates as generalized coordi¬ 
nates. Then 



or 



This leads to the Hamilton equations (qt = x{)\ 



or in vector notation 
p = —grad V. 

These are Newton’s equations of motion. 

EXAMPLE 

18.1 Central Motion 

Let a particle perform a planar motion under the action of a potential that depends 
only on the distance from the origin. It is obvious that we should use plane polar 
coordinates (r, cp) as generalized coordinates. 

L = T — V = i mv 2 - V = i m(r 2 + r 2 <p 2 ) - V(r). 
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With p a = dL/dq a , we get the momenta 


p r = — = mr or ir 
dr 


Pr 


m 


0Lj 2 • 

Pep = —r =mr <p or <p 



Thus, the Hamiltonian reads 



The Hamilton equations then yield 


_ _ Pr 


dH 
3 P<p 



d p r m 


and 



3 H _ P 2 V dV 



dr mr 3 dr ’ 



Fig. 18.1. On pendulum mo¬ 
tion 


cp is a cyclic coordinate. From this follows the conservation of the angular momentum 
in the central potential. 


EXAMPLE 


18.2 The Pendulum in the Newtonian, Lagrangian, and Hamiltonian Theories 

The equation of motion of the pendulum shall be derived within the frames of New¬ 
ton’s, Lagrange’s, and Hamilton’s theory. 

Newtonian theory: We begin with Newton’s axiom 

P = K. 

The arclength of the displacement is denoted by s, and the tangent unit vector by T. 
Then (see Fig. 18.1) 

K = —mg sin@T, 

and thus, 

ms T = —mg sin ©T. 

With 5 = /0, we have s = / ©. We therefore get the equation of motion 
© + y sin© = 0. 



Fig. 18.1. On pendulum mo¬ 
tion 
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For small displacements (sin © ~ © H-), this becomes 

© + I© = 0 . 

This differential equation has the general solution 


© = A cos / — t + B sin —t, 


where the constants A and B are to be determined from the initial conditions. 

Lagrangian theory: 

T = -mv 2 = -m(/©) 2 = -m/ 2 © 2 , 

2 2 2 

V = mgh = mg(l — l cos ©) = mgl (l — cos ©). 

Hence, the Lagrangian for this conservative system reads 

L = T -V = l -ml 2 & 2 - mgl( 1 - cos ©). 

Now we use the Lagrange equation 
d /3L\ 3L _ Q 

dt Va©/ 9© ” 

With 

dL dL 9 . 

a© 5 a© 

we have 

m/ 2 © + mg/ sin © = 0 or ©+ysin© = 0. 


Hamiltonian theory: Using the generalized momentum 

dL 2 . 

Ps = —- = ml ©, 

a© 

the kinetic energy can be written as 



Since the total energy of the system is constant, the Hamiltonian reads 

1 Ps 

H = T -\- V = - —% + mgl{ 1 - cos ©). 

2 ml z 

The Hamilton equations yield 


P® = ~ 


dH 

a© 


—mgl sin © 


and 


dH 

dps 


P® 
ml 2 


Example 18.2 
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The last equation gives 

ps = ml 2 ®. 

Differentiation yields 

ps = ml 0. 

By comparing this with the above expression for /?©, we finally get again 
0 + j sin0 = 0. 

EXERCISE 

18.3 Hamiltonian and Canonical Equations of Motion 

Problem. A mass point m shall move in a cylindrically symmetric potential V (q, z). 
Determine the Hamiltonian and the canonical equations of motion with respect to a 
coordinate system that rotates with constant angular velocity co about the symmetry 
axis, 

(a) in Cartesian coordinates, and 

(b) in cylindrical coordinates. 

Solution, (a) The coordinates of the inertial system (x,y,z) and those of the rotating 
reference system (V, y', z') are related by 

x = cos (cot)x' — sin (cot)y', 

y = sin (cot)x' + cos (cot)y', (18.7) 

z = z !. 

Derivation of the coordinates yields 

x = cos (cot)x' — sin (cot)y' — co(sin(cot)x' + cos (cot)y'), 

y = sin (cot)x + cos (cot)y' + co(cos(cot)x’ — sin (cot)y'), (18.8) 

z = z!. 

In the primed coordinate system, the Lagrangian takes the form 

L = ^ m[x 12 + y' 2 + z! 2 + o?(x' 2 + y' 2 ) + 2 oo(y l x l - i'/)] 

-V(x',y',z'). (18.9) 


From (18.9), we calculate the generalized momenta as 
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/ 3L / / 

Px = dl' =m(yX ~ My ^’ 

t 3L . . 

Py = ^y = m (y + 6 >X ), 

, dL 
3 z 


(18.10) 


Now we solve (18.10) for the velocity components x',y', z!\ 


x' = — + coy', 
m 


Py 

y — — — cox , 
m 

z ~ Pz 
m 


(18.11) 


and calculate the Hamiltonian according to 


h = y2^ ipi ~ L ■ 

i 

This yields 


(18.12) 


H = m(x z — cox y ) + m(y 1 + coy x ) + mz! z — L 
= - \lx rl — 2 cox' y' + 2y /2 + 2 coy' x' + 2 z! 2 

- (i /2 + y /2 + z /2 + o>V 2 + y /2 ) + 2co{y l x l - *'/))] + V 

= \m[x' 2 + y 2 + y 2 - o>V 2 + y 2 )] + y 


1 

= -m 
2 


r„/2 


/ 2 


CO 


Px , o ^ / / I 2/2 , ^ / / I 2/2 

-r+2-^yft)j + ^--2-x p -\-cox 


m 


m 


+ rt _ co 2 x , 2 _ co 2 y ,2 

m A 


+ y 


= 2-[p' x 2 + Py 2 + p?] - co[x'p' y - y'p'x] + v(Jx' 2 + y' 2 , z'). (18.13) 

H is explicitly time-independent and is therefore a constant of motion. The canonical 
equations of motion read 


./ dH 1 / , 

* = = ~Px + a, y * 

3 p' x m 

., i , 

y = —Py - MX , 
m 7 

1 , 

* = -Pz’ 


Exercise 18.3 


(18.14) 
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Exercise 18.3 


Px = 


Py = -VPx 


Pz = 


8H _ , dV 

dx' P y dx' ’ 

ay 
ay’ 
ay 

dz' 


(18.15) 


(b) For the transition to cylindrical coordinates, we differentiate the transformation 
equations 


x = q cos cp , y = q sirup 
with respect to the time: 


(18.16) 


X = Q COS (p — Q Cf) Sin (p , 
y' = q' sin ip' + q' cp' cos cp '. 

From (18.9) and (18.17), we calculate the generalized momenta: 
, _ dL _dL dx' dL dy' 

Pe ~~dp~ wip + dydp 

= Px COS( p' + Py sin<p', 

dL dy' 


dL dL dx' 


dcp' dx' dcp' dy' dip' 
= ~p' x q' sirup' + p' q' cos cp'. 


Now we solve for p' x and p' From (18.18), it follows that 


(18.17) 


(18.18) 


(18.19) 


, _ P Q -Py&n<p 

P X / ’ 

cos <y 

and from (18.19) (with (18.20)), 

_ p'f cos <p' + (p' 0 - p' y sin (p')q' sin cp' 

Py — 


y cos 2 cp' 
p'y (q' cos 2 <p' + q' sin 2 cp') 


q' cos 2 cp' 


P'cp cos cp' + p' Q Q sin cp' 
2~77 


£ r cos 2 cp 


1 


Py = y p[p cos (p r ~\~ p'q sin <p! 


Analogously, we obtain 
, 1 


Px =Po C0S( P 


-Pep sm <P • 


(18.20) 


(18.21) 


(18.22) 
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Now we insert (18.21) and (18.22) into (18.13) and obtain 

, 1 


H = 


r 2 2 / 

2 

P Q cos <P ■ 

- ~P 

- 

Q 

+ p ' e 2 sin 2 <p’ 

2 

+ - 


f ! 

0 Pep l 


I'e sin (p' p'p < 


6 


2 / / 2 
,2 *P P(p 


/ 1 / 9 2 / 

0 + “72 Pep cos <P 


1 

2m 


-co(x'p' -y f p f x ) + V(p',z') 


1 

2m 


+ ^lK> 2 + K 2 


cos cp'p' sin (io' + p r cos — p r cos 


— sin ff p cos sin cp r — p sin 

^ Q r * 


+ V (q' , z) 


1 

2m 




/ 2 
<P 



6>P'<P + V(Q f ,z). 


(18.23) 


A comparison of (18.13) and (18.23) shows that the Hamiltonian becomes especially 
simple if it is represented in coordinates adapted to the symmetry of the problem. 
From (18.23) we see that H does not depend on the angle (p r ((p' is a cyclic coordinate), 
hence the angular momentum component p f is a constant of the motion. 

The canonical equations of motion read 


1 


Q =-Po ’ 


m 


Po = 


J 2 


dv 


Q mp' 3 v dp' ’ 


<P = 

mp 

p'<p=° 


r2 y <P 


CO, 


1 , 

z = —P z , 

m 

av 


(18.24) 


18.1 The Hamilton Principle 

The laws of mechanics can be expressed in two ways by variational principles that 
are independent of the coordinate system. The first of these are the differential prin¬ 
ciples. In this approach, one compares an arbitrarily selected momentary state of the 
system with (virtual) infinitesimal neighbor states. One example of this method is the 
d’Alembert principle. Another possibility is to vary a finite path element of the sys¬ 
tem. Such principles are called integral principles. The “path” is not understood as the 
trajectory of a point of the system in the three-dimensional position space, but rather 
as the path in a multidimensional space where the motion of the entire system is com¬ 
pletely fixed. For a system with / degrees of freedom, this space is /-dimensional. 
In all integral principles the quantity to be varied has the dimension of an action 
(= energy • time); therefore, they are also called principles of minimum action. As 
an example we will consider the Hamilton principle. The Hamilton principle requires 
that a system moves in such a way that the time integral over the Lagrangian takes an 
extreme value: 


Exercise 18.3 
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shall have an extremum, which can also be expressed as follows: 

*2 

S J Ldt = 0. (18.25) 

h 

The path equation of the system can be determined by applying this principle. 

Before considering (18.25) in more detail, we will briefly deal in general with the 
variational problem. 


EXAMPLE 


18.4 A Variational Problem 


As an example for substituting a description in terms of coordinates by a description 
independent of coordinates, one can consider the definition of a straight line in the 
plane. The straight line is uniquely determined by fixing two of its points, and it can 
also be described by a linear equation between the coordinates x and y. It can further 
be described by the differential equation 



(18.26) 


with the further prescription that the values of the desired function y(x) for x = x\ 
and x = X 2 are given numbers. These are descriptions using rectangular coordinates. 
The straight line can however also be described as the shortest connection between 
two points, i.e., by 


J ds = minimum. 


(18.27) 


One may imagine the two given points as being connected by all possible curves, and 
among these curves that curve be selected which yields the minimum value for the 
given integral. This description of the straight line is independent of the choice of 
particular coordinates. 

As a preparation for the following, we show how the search for the shortest connec¬ 
tion between two points of the plane can be reduced mathematically to (18.26). After 
introducing rectangular coordinates x and y, the problem is to look for a function y (x) 
for which y(xi) and yfe) have given values and the integral 

X2 

1 = J +y>(x) 2 dx (18.28) 

XI 

takes a minimum value. Similar problems do not need to have a solution. So one 
could put the problem (18.27) or (18.28) and prescribe not only the start point and the 
endpoint, but also the direction of the curve at the start and endpoint, respectively. One 
easily recognizes that under these conditions there is no shortest connection, unless 
both of the given directions incidentally coincide with the straight connection. 
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The problem (18.28) has some similarity with the search for the minimum of a 
given function fix). There one considers a small change of x and forms 

df(x ) = f'(x)dx. 


If f'(x ) y 0, f{x) can increase or decrease for small changes of x, and thus, there 
is no minimum at the point x. A necessary condition for a minimum is therefore 
f'(x) = 0. This condition is not sufficient; it is also fulfilled for a maximum. 

In the problem (18.28), we do not have to change a variable but a function y(x). 
We replace y(x) by a “neighboring” function yo(x) + sr](x) of the desired func¬ 
tion yo, where we will afterward assume the number s is arbitrarily small. We must 
have r)(x\) = r](x 2 ) = 0. / is then replaced by y' 0 + srj', and instead of the integrand 
y /2 we obtain the Taylor series expansion into powers of s: 

A + (Jo + £ V> 2 = a/ 1 + Jo 2 e ~r ^ ^ • •)’ 

y+y 


where the term indicated by £ 2 (...) can be neglected for sufficiently small \e\. There¬ 
fore, we have 

X2 X2 X2 

/(e) = J yi + (yQ + STf') 2 dx « J + 3?' 2 dx + s J - = = r)' dx, 

x\ xi xi y ^ -fo 

which shall take a minimum for s = 0. If the integral in the second term does not 
vanish, the integral 

X2 

J y 1 +Jo 2 dx 

XI 

can increase or decrease by changing the function yoOO, depending on the sign of e. 
Hence, yo(x) does not provide a minimum of this integral. For a minimum rather 
exists the necessary condition 


/ 


f dx = 0 


(18.29) 


for any function r](x) that vanishes at x\ and JC2- To be able to exploit the far-reaching 
arbitrariness of the function r](x), we transform (18.29) by integration by parts: 


y+y 


*2 * 2 


XI 


f 


y'o 


dx 


XI 




:dx— 0. 


Because r](x i) = rj(x 2 ) = 0, the first term drops. The second term 


/ 


rj • 


X\ 


d y'o 


dx 


Example 18.4 


(18.30) 
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Example 18.4 


then and only then becomes zero for all allowed functions r]{x) if everywhere between 
X] and X 2 we have 


d Vn 

- . 0 =0. (18.31) 


If this equation were not satisfied everywhere, we could choose r)(x) so that it is 
always positive where 

<> y'o 

is positive, and choose it as negative where this expression is negative, and in this 
way establish a contradiction. We can also conclude this way: If (18.31) were not 
fulfilled anywhere, one should set r]{x) equal to zero everywhere, except for a certain 
interval about this place. But then the integral (18.30) does not vanish. We could not 
choose the quantity r]' in (1 8.29) in this way; thus we could not draw the corresponding 
conclusion for (18.29). From (18.31) now follows y' 0 = constant or >’q = 0; that means 
the former description (18.26). Thus, our calculation has replaced the requirement 
that a definite integral be minimized by a function, by a differential equation for this 
function. 

Equation (18.31) allows yet another interpretation. We have 

d / y" 

dx yi + y 2 y/i + y 2 ) 3 

As is shown in the theory of curves, this is an expression for the curvature of a curve. 
Equation (18.31) thus states that the desired curve everywhere has the curvature 0. 

We just have treated a simple problem of the “variational calculus.” Problems of 
the type (18.27) or (18.28) are called variational problems. In Exercises 18.5 and 18.6 
we shall meet further, less trivial variational problems. 


18.2 General Discussion of Variational Principles 

Given the integrable function F = F(y(x), y'(x)), we look for a function y = y{x), 
so that the integral 

*2 

I = J F(y{x),y\x))dx 

X\ 

takes an extremum value. 

This problem is transformed into an elementary extremum value problem by cov¬ 
ering the ensemble of all physically meaningful paths by a parametric representation: 

y(x,s) = yo(x)+sr](x), 

where £ means a parameter that is constant for every path, r)(x) is an arbitrary differ¬ 
entiable function that vanishes at the endpoints: 

r]{x\) = r](x 2 ) = 0. 

The desired curve is given by yoOO = y(x, 0). 
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Fig. 18.2. Possible paths from 

to (x 2t y 2 ) 


The condition for an extremum value of the integral I is then 
dl 


ds 


= 0 . 


£=0 


The differentiation under the integral symbol (allowed if F is continuously differen¬ 
tiable with respect to s) yields 


dl 


x 2 

/( 


dF dy dF dy' 


dy ds dy' ds 


-—=11 -1-- — \dx = / — T) H- 7 T) I dx. 

dg I ' ao ac / I \ a-»> a-\»^ / 


x 2 

/( 


dF 


dF 


dy dy / 


The second integrand can be integrated by parts: 


x 2 

r dF 

J ay 


-- dr/ , 

- dx = 

dx 


dF 


XI 


_dy 


~ f ri 


x 2 


XI 


x 2 

/( 


d HF\ 

Tx V ln 


Since the endpoints shall be fixed, the term integrated out vanishes, and the extremum 
condition reads 


x 2 

/( 


dF d dF\ i 

-1 ri dx = 0. 

dy dx dy' ' 


x i 


Since rj(x) can be an arbitrary function, this equation is generally satisfied only then 
if 


d dF(y(x),y'(x)) dF(y(x),y'(x)) 


dx 


dy' 


dy 


= 0 . 


(18.32) 


This relation (18.32) is called the Euler-Lagrange equation. It is a necessary condition 
for an extremum value of the integral I. The solution of the Euler-Lagrange equation, 
a differential equation of second order, together with the boundary conditions yields 
the wanted path. To simplify notation, we define the variation of a function y(x,s) as 
the difference between y(x,s) and y(x, 0) 


Sy = y(x,e) -y(x,0) = f 
ds 


£= 0 


for very small s. Then the variational problem can be formulated as 


x 2 

8 J F(y(x),y'(x))dx = 0. 


F can also include constraints by means of Lagrange multipliers (compare Chap. 16). 
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EXERCISE 

18.5 Catenary 

Problem. This is an example with a constraint. A chain of constant density a (mass 
per unit length: a = dm Ids) and length / hangs in the gravitational field between two 
points P\ (. x \, y\) and Pi(x 2 , yi)- We look for the form of the curve, assuming that the 
potential energy of the chain takes a minimum. 


Fig. 18.3. A chain hangs in y 
the gravitational field 



Solution. The potential energy of a chain element is 


dV = gay ds. 


The total potential energy is then 

*2 



where the line element is given by 




dx 


The constraint of given length l is represented by 



With the Lagrange multiplier X, the variational problem reads 



Since 81 = 0, we can introduce the function 



in the Euler equation (18.32), where we chose /jl = X/ga. From 


dF d dF 
3y dx dy' 
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it follows that 

(y - v)y" - / 2 -1 = o. 


We rewrite the last equation. With 

n dy' dy' dy f dy r 

^ dx dy dx ^ dy' 

we obtain 

, , fdy' ,2 . , dy y'dy' 

(y-v)y~r=y +i, 

dy 


y - fx 1 + y' 2 ' 


Integration yields 


ln(y - fj,) + In Ci = - ln(l + y' 2 ) 


or 


ci(y-M)=yi+y /2 . 

From this, we get 
dy 


/ 


yjcliy-ti) 2 


= = / Jjf. 

1 J 


To integrate the left side, we substitute coshv = C\(y — v), since cosh 2 v — 1 
sinh 2 v. Then 

dy = — sinh vJv, 

Ci 

and therefore, 


*/-/ 


dx. 


Integration yields 
v = C\(x + C 2 ) 


or 

1 

y = — cosh(Ci (x + C 2 )) + p,. 

Ci 

Thus, the solution is the catenary. The constants give the coordinates of the lowest 
point (jco, yo) = (—C 2 , (1/Ci) + /jl). They are determined by the given length / of the 
chain and by the suspension points Pi and P 2 . 
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EXERCISE 


18.6 Brachistochrone: Construction of an Emergency Chute 


Problem. On board an aircraft, a fire breaks out after landing. The passengers must 
leave by an emergency chute on which they glide down without friction. Determine 
by variational calculus the form of the chute with the aim to evacuate the plane as fast 
as possible (height of the hatch yo; distance to the bottom vo). Find the time of gliding 
as compared to the harsh free fall, assuming vo = (tv /2)yo. 

Hint: Use the substitution 


, dy © 

y = -p = -«*—! 

ax 2 


Remark: This problem is known as the “brachistochrone.” 


Fig. 18.4. A passenger glides 
down a chute 



Fig. 18.5. Illustration of vari¬ 
ous chutes 



Solution. The problem goes back to the Bernoulli brothers (brachistochrone, 1696). 
Energy conservation yields 


mgyo 

g(yo-y) 


1 

2 

1 

2 


mv 2 + mgy , 



dy 

dt 


21 


m 2 


0 dx ) 2 + {dy) 2 


2g{yo - >’) 
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The total time T is then 

t x 0 

T = 

0 0 


h'-ir- 


+ ( dy/dx ) 2 

2 g(yo - y ) 


dx. 


(18.33) 


To get the minimum time, one has to solve a variational problem of the form 


x 2 


S I F(x, y, y')dx = 0, 


XI 

Because 


y(xi) = yo, 
y(x 2 ) = 0 . 


„ f/dF „ 9F A , /Y9F d 9F\ 

0 = JW y+ sy Sy r = j{^-^sy) 


8ydx , 


XI XI 

the Euler-Lagrange equation reads 
d 3E 3E 


dx 3 y' dy 


= 0 


(18.34) 


or 


;F + y' 


dy' 2 " ' 3y3y' 


E + 


9 2 9F 

-- 

3v3y 3y 


(18.35) 


If the functional F is independent of x, (18.35) can be directly integrated. One finds 


d 

dx 




d 3 F 


f d F dF 


-by- 

dx dy' dy' dx 


,3 F 

dy' 


d dF ( . 

= y T,hy + Vv~ r 
= JjL?L-°J.) = 0; 

\dx 3y' 3y ) 


dF 

dy 


,d_F_ 

dy 


dF 


hence, 


dy' 

In our case, (18.33) is 


3 /- F = constant = -. 

3y' c 


F = , 


1 + y 


a 


2g(yo - y) 


Then (18.36) reads 

v , = i _y_ _ yi +/ 2 = i 

V^giyo - y) yf\ + y 2 V2^(yo - y) c’ 

l _ l 

2g(3'0-3 ; )(l+y 2 ) c 2 ' 


(18.36) 


Exercise 18.6 


(18.37) 
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Exercise 18.6 


Fig. 18.6. Possible 
solution 


The transformation y' = —cot 2 (0 /2) yields 

C 2 /2 , 2 /©\ 1 

— IT y — IT cot ( — ) — —~-, 

\2 / sin 2 (©/2) 


2g(jo - y ) 
and thus, 


y = ^0 - ^-(1 - COS©). 
4 g 


By integration, one finds an equation for x(0), namely, 
© dy c 2 . /0\ f&\d& 

■ CO, 2 = * = “^ 5m l2j COS l2)* 


= / <,t ' = C/ sin2 ( 




c 2 
2 g 


-©'- - sin©' 
2 2 


x = —(0 — sin 0), 

4 g 


y = yo-7~ (l-cos©). 
4 g 


This is just the parametric representation of a cycloid. 

The maximum value of 0 is determined by vo and yo, namely, 

xo 0o — sin 0o 
yo 1 — cos 0o 


(18.38) 


(18.39) 


The transcendental equation (18.39) can be solved in general only numerically. Special 
cases: 


© 

o 

II 

O 

TV 

2tt 

xo/yo = 0 

Tt / 2 

00 


types of i 




y o- 

i ■ y °" 

y • y °' 



\ ^ n 

\ n 

\ ^ 


\ x o<jy o 

x o = y 

\x 0 >jy 0 


Calculation of the gliding time according to (18.33) and (18.37): 


t =)\ 


I i+y 2 

2g (yo -y)' 


-dx = 


-I 


' (dx/dy) 2 + 1 

2g(yo - y) 


dy 


yo 
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y o 


2g(yo ~ y)(c 2 - 2g(y 0 - y)) 


dy 


= —2 arctan, 
1 


I C 2 - 2g(yo - y) 


2g(;yo - >’) 


JO 


C I TV 

1-arctan, 

g \ 2 1 


! ~ 2gyo \ 

2gyo ) ’ 


T = -arccot 1 
8 


I c 2 - 2gyo 


2gyo 

The integral can be found in tables. 


*0 = yw =>• ©0 = 7T =>• C = V2gyo 

For comparison, the time of free fall is 




Exercise 18.6 


As is seen already from (18.25), according to the Hamilton principle the time is not 
being varied. The system passes a trace point and the appropriate varied trace point at 
the same time. Hence, 

8t = 0. 

Starting from the integral 


81 = 8 


*2 

J L(q a (t),q a (t),t)dt = 0, 


t\ 


« = 1,2 


(18.40) 


where / is the number of degrees of freedom, we perform the variation according to 
the procedure described above and show that the Lagrange equations can be derived 
from the Hamilton principle. We describe the variation of a path curve q a (t ) by 
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where the 8 q a vanish at the endpoints, 
Sq a (t\) = 8 q a (t 2 )= 0 . 

Since time is not being varied, we have 

h t2 t2 


8 [ Ldt= [ 8Ldt= [ y -8 q a + -8 qa\dt. 

J J J V „ 9<?a „ dga ) 

h h h 01 “ 

Because 

d d 

tMo = -T(gait, s ) - q a (t, 0)) 
at at 

= -j ~{g a (t, £)) - 4:(gait, 0)) 

dt dt 

d 

= 8—q a (t) = 8q a (t), 
dt 

integration by parts of the second summand yields 

t2 t 2 

r dL p . T r dL d o T 

/ 1 —8q 0l dt= / -——8q a dt 
J 8q a J dq a dt 

h h 

' dL l t2 } / d dL \ 

= ~ / -T-XT- )Sq a dt. 

L dqa iu J \dtdq a J 


(18.41) 


(18.42) 


(18.43) 


Since 8 q a vanishes at the endpoints (integration limits), we get for the variation of the 
integral 


81 = 



d dL 
dt dq a 


8 q a ) dt = 0. 


(18.44) 


For holonomic constraints, we imagine that the dependent degrees of freedom were 
eliminated. We take the q a as the independent coordinates. Hence, the 8 q a are in¬ 
dependent of each other, and the integral vanishes only if the coefficient of any 8 q a 
vanishes. This means that the Lagrange equations hold: 


d dL dL 
dt dq a dq a 


(18.45) 


Likewise, one can obtain the Hamilton equations by replacing L by Pa^a ~ H 
and considering the variations 8 p a and 8 q a as independent. This will be worked out 
in the Exercise 18.7. 

In order to show the equivalence of the Hamilton principle with the formulations of 
mechanics studied so far, we shall demonstrate its derivation from Newton’s equations. 
We consider a particle in Cartesian coordinates. It moves along a certain path r = r (t) 
between the positions r(t\) and rfe). Now the path is varied by a virtual displacement 
<5r that is compatible with the constraint: 


r (t) -> r(f) + Sr(f), 


8 r(t\) = 8 r(t 2 ) = 0. 
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The time is not varied. The work needed for the virtual displacement is 


8A = F-8r = F a -Sr, 


ifF e is the external force and the constraint reaction does not perform work. If is 
conservative, then 

F e -8r=-8V, 

and according to Newton 


—8V = mr • 8r. 


The right-hand side can be transformed (the operator (d/dt)8r = 8r is treated accord¬ 
ing to (18.42)): 

d d (\ 9 \ 

— (r-<$r) = r- — 8r + r • 8r = r • 8r + r • 8r = 5 -r 2 ) + r-<$r. 
dt dt \2 ) 


Multiplication by the mass m yields 


mr • Sr = m — (r • 8r) — 8 ( -mr 


1 


•2 


and therefore, 

d . 

8(T — V) = 8L = m — (r • 5r). 

dt 

Integration with respect to time leads to 


s f 


Ldt = m [r • <5r]^ = 0. 


Thus, the Hamilton principle for a single particle has been derived from Newton’s 
equations. The result can be directly extended to particle systems. This can be un¬ 
derstood quite generally in the following way: If a particle system obeys the La¬ 
grange equations (18.45) (which are equivalent to Newtonian mechanics), then we 
have (18.44) and from that—because of (18.43) —again (18.41) or (18.40), provided 
that 8 q a (t\) = 8 q a (t 2 ) = 0. Thus, the Lagrange equations are equivalent to the Hamil¬ 
ton principle. 


EXERCISE 

18.7 Derivation of the Hamiltonian Equations 

Problem. Derive the Hamilton equations from the Hamilton principle. 
Solution. The Hamilton principle reads 

t2 

8 J Ldt = 0, 

h 


(18.46) 
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Exercise 18.7 


where the Lagrangian L is now expressed by the Hamiltonian H ; hence, 


L = ^2,p a q a ~ H(p a ,q a ,t). 

a 


(18.47) 


Then (18.46) becomes 

*2 *2 


/ SLdt = J 


o . o- dH O 9# 

°PaQa 4” Pa°Qa ~~ 7 vPa — 7 

dp a dq a 


dt. 


(18.48) 


The second term on the right-hand side can be transformed by integration by parts, 

*2 t 2 *2 

J p a 8q a dt = J p a ^-8q a dt = paSq a H\^ - J p a 8q a dt. (18.49) 

t\ t\ t\ 


The first term vanishes since the variations at the endpoints vanish: 8 q a (ti) = 
8 qa{h) = 0. Hence, (18.48) becomes 


0 = 


h 

J 8 Ldt = 


t\ 



dt. 


(18.50) 


The variations 8 p a and 8 q a are independent of each other because along a path in 
phase space the neighboring paths can have different coordinates or (and) different 
momenta. Thus, (18.50) leads to 




Pa = 


dH 
dpa ’ 
dH 
dqa 


(18.51) 


which was to be demonstrated. 


18.3 Phase Space and Liouville's Theorem 

In the Hamiltonian formalism, the state of motion of a mechanical system with / de¬ 
grees of freedom at a definite time t is completely characterized by the specification 
of the / generalized coordinates and / momenta q\, ...,qf\ p \,..., p/. These qt 
and pi can be understood as coordinates of a 2/-dimensional Cartesian space, the 
phase space. The /-dimensional subspace of the coordinates qt is the configuration 
space ; the /-dimensional subspace of the momenta pi is called momentum space. 
In the course of motion of the system the representative point describes a curve, the 
phase trajectory. If the Hamiltonian is known, then the entire phase trajectory can 
be uniquely calculated in advance from the coordinates of one point. Therefore to 
each point belongs only one trajectory, and two different trajectories cannot intersect 
each other. A path in phase space is given in parametric representation by qjfit), pk(t) 
(k =\,...,/). Because of the uniqueness of the solutions of the Hamilton equations, 
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the system develops from various boundary conditions along various trajectories. For 
conservative systems the point is bound to a (2/ — 1)-dimensional hypersurface of the 
phase space by the condition H(q, p) = E = constant. 

EXAMPLE 


18.8 Phase Diagram of a Plane Pendulum 

If the angle <p is taken as a generalized coordinate, then we have for the plane pendu¬ 
lum (mass m, length l) 

Pep = ml 2 if). 

The Hamiltonian, which represents the total energy, reads 

1 P 2 

H = -m(l(p) 2 — mgl cos ip = —^ — mgl cos p = E. 

2 2 ml 1 

The origin of the potential was put at the suspension point of the pendulum. One then 
gets the equation for the phase trajectory p<p = p<p(<p): 

Pep = ±^2ml 2 (E -\-mgl cos ip). 

Thus, we obtain a set of curves with the energy E as a parameter. 

For energies E < mgl , the phase trajectories are closed (ellipse-like) curves; the 
pendulum oscillates forth and back (vibration). If the total energy E exceeds the value 
mgl , the pendulum still has kinetic energy at the highest point <p = ±n and continues 
its motion without reversal of direction (rotation). 



Fig. 18.7. Phase space and 
phase diagram of the one¬ 
dimensional pendulum 


We now consider a large number N of independent points that are mechanically iden¬ 
tical, apart from the initial conditions, and are therefore described by the same Hamil¬ 
tonian. As a specific example, we can imagine particles in the beam of an accelerator. 
If all points at time t\ are distributed over a 2/-dimensional phase space region G\ 
with the volume 

AV = Aq\ • • • Aqp • Ap\ • • • A pf, 

one can define the density 
AN 
e= AV 

With the course of motion, G\ transforms according to the Hamilton equations into 
the region G 2 . 
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Fig. 18.8. Evolution of a re¬ 
gion in phase space 



The statement of the Liouville theoremi 3 is 

The volume of an arbitrary region of phase space is conserved if the points of its 
boundary move according to the canonical equations. 

Or, in other words, performing a limit transition: 

The density of points in phase space in the vicinity of a point moving with the fluid 
is constant. 


To prove that, we investigate the motion of system points through a volume element 
of the phase space. Let us first consider the components of the particle flux along the 
qk~ and /^-direction. 

The area ABCD represents the projection of the 2/-dimensional volume element 
dV onto the qk, Pk -plane. 


Fig. 18.9. Projection of the 
volume element onto the 
q k , Pk -plane 


Pk 


D 


C 


dp k -► 

dq k 


A 


B 




The number of points entering the volume element per unit time through the “side 
face” (with the projection AD onto the qk , Pk -plane) is 

QQk dp k ’ dVk, 

where 

/ 

dVk = Y[dqadp a 

a=\ 

a^k 

is the (2/ — 2)-dimensional remainder volume element; dpk • dVk is the magnitude of 
the lateral surface with the projection AD in the Pk,qk~ plane. 


3 Joseph Liouville , b. March 24, 1809, St. Omer-d. Sept. 8, 1882, Paris. Liouville was professor of 
mathematics and mechanics in Paris, at the Ecole Polytechnique, at the College de France, and at 
the Sorbonne. He was a member of the Bureau of Measures and of many scholarly societies. From 
1840 to 1870, he was considered the leading mathematician of France. He worked on statistical 
mechanics, boundary value problems, differential geometry, and special functions. His constructive 
proof of the existence of transcendental numbers and, in 1844, the proof that e and e 1 cannot be roots 
of a quadratic equation with rational coefficients, were of great significance. 
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The Taylor expansion for the points leaving at BC in the first direction yields 
{eqk +-^ieqk)dqk\dpk ■ dVk- (18.52) 

Analogously, for the flux in pk -direction we have 


entrance through AB : QPkdqk ■ dVk, 


exit through CD: 


(ePk + 


7— (QPk)dpk \dq k -dV k . 

dPk ) 


(18.53) 


From the flux components in pk~ and ^-direction, the number of system points per 
unit time 


( 9 9 

— (Qqk) + — (QPk) \dV 
dq k dp k 


(18.54) 


gets stuck in the volume element. 

By summing over all k = 1one obtains the number of points that get stuck 
in total. This quantity just corresponds to the change with time (time derivative) of the 
density multiplied by dV. Hence, we can conclude 



(18.55) 


We are dealing here with a continuity equation of the form 
dp 

div(gr) + — = 0. 

Ot 

The divergence refers to the 2/-dimensional phase space: 


/ 


d 


v = y — 


/ 

+E 


k =1 


d 

dpk 


Continuity equations of this type often appear in flow physics (hydrodynamics, 
electrodynamics, quantum mechanics). They always express a conservation law. 
Application of the product rule in (18.55) yields 



dQ_ 

dqk 


dqk dp . 

qk + Q^- + ^-Pk + Q 
dqk dp k 


dpk \ 

dpk) 


dg 

+ — = 0 . 
dt 


(18.56) 


From the Hamilton equations, we have 

dq k d 2 H dp k d 2 H 

-=- and -=-. 

dqk dqudpk dpk dqkdpk 

If the second partial derivatives of H are continuous, then 
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and from this, it follows that 


/ 


dg dg . \ o/j 


dQ 


(18.57) 


This just equals the total derivative of the density with respect to time, 


d 

j, Q=a ■ 


(18.58) 


and hence, £ = constant. 


EXAMPLE 


18.9 Phase-Space Density for Particles in the Gravitational Field 

The system consists of particles of mass m in a constant gravitational field. For the 
energy, we have 

p 2 

H = E = - - mgq. 

2m 

The total energy of a particle remains constant. 

The phase trajectories p(q) are parabolas 

p = y/2m(E + mgq), 

with the energy as a parameter. We consider a number of particles with momenta at 
time t = 0 between the limits p\ < p < p 2 , and with energies between E\ < E < E 2 . 
They cover the area F in phase space. At a later time t the points cover the area F'. 
They then have the momentum 

p' = p + mgt, 

so that F' is the area between the parabolas limited by p\ + mgt < p' < P 2 + mgt. 
With 

(p 1 /2m) - E 

q =-. 

mg 


Fig. 18.10. On Liouville’s 
theorem: Phase space for 
particles in the gravitational 
field 
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the size of the areas is calculated as 

P2 (l/mg)(Q? 2 /2m)-Ei) p 2 

F=[dp f dq = — { dp = — — (p 2 - pi), 

J J mg J mg 

Pi ( l/mg)((p 2 /2m)—E2 ) Pi 


and likewise, 


E 2 -Ei 

F = - (P2~P\) 


mg 
E 2 ~ E\ 
mg 


(pi-pi). 


This is just the statement of Liouville’s theorem: F = F' means that the density of the 
system points in phase space remains constant. The significance of Liouville’s theorem 
lies in the field of statistical mechanics, where one considers ensembles because of 
lack of exact knowledge of the system. 

A special application is the focusing of particle currents in accelerators where a 
large number of particles are subject to identical conditions. Here a reduction of the 
beam cross section must lead to an undesirable broadening of the momentum distrib¬ 
ution. 


Example 18.9 


18.4 The Principle of Stochastic Cooling 4 

An essential implication of Liouville’s theorem is that the phase space occupied by an 
ensemble of particles in the absence of friction behaves like an incompressible fluid. 

We shall show in the following that the principle of stochastic cooling leads to a 
(seeming) contradiction to the theorem of Liouville. For this purpose it is necessary 
to expand on the method of stochastic cooling of antiprotons developed by van der 
Meer. 5 The successful application of this method led to the proof of the existence of 
the intermediate vector bosons (IVB) W + , W ~, and Z° predicted by the theory of 
weak interactions. 


4 This chapter was stimulated by a lecture given by Professor Herminghaus (Mainz), at the occa¬ 
sion of the sixtieth birthday of Professor P. Junior 1988 in Frankfurt. My thanks go to colleague 
Mr. Herminghaus for leaving his manuscript, which I found very useful when writing this section. 

5 Simon van der Meer, b. Nov. 24, 1925, Den Haag. He received the Nobel prize for physics in 
1984. He studied mechanical and electrical engineering at the Technical University of Delft, took his 
diploma exams as engineer and worked at first in the Philips central laboratory in Eindhoven. In 1956 
he got a position as a development engineer at CERN in Geneva. Here he soon earned a reputation 
for professional competence, imagination, and also for his talent for theory. He was appointed a 
“senior engineer.” Meanwhile the Italian physicist Carlo Rubbia, a scientific coworker at CERN, had 
developed the idea to shoot 450 GeV protons from the just-finished super-high energy accelerator 
“SPS” onto their artificially produced “antiparticles”—antiprotons. The project was realized as a 
collider system. For the first time one could generate and demonstrate the so far only hypothetical 
intermediate W- and Z-bosons. Van der Meer, a “genuine puzzler,” provided a genial invention: the 
stochastic cooling, which allowed researchers to collect antiprotons in sufficient quantity and to store 
them for the experiments. Only one year after their great success, which proved the predictions of 
theory in a brilliant way, van der Meer and Rubbia were awarded with the Nobel prize for physics 
“for decisive merits in the discovery of the field quanta of weak interaction.” 
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Fig. 18.11. Schematic 

representation of a proton- 
antiproton collision. In the 
collision quark-antiquark 
pairs are created whose reac¬ 
tions can lead to the creation 
of intermediate vector bosons 
(• = quark, o == antiquark) 


proton 



antiproton 




According to the predictions of the theory, these particles should be able to decay 
as follows: 


IVB —> lepton + antilepton, (18.59) 

IVB —> quark + antiquark. (18.60) 

For the experimental proof of the IVB, one utilized the inverse reaction (18.60), by 
shooting high-energy beams of antiprotons onto protons in the proton synchrotron 
(PS) of the CERN. Since the protons consist only of three quarks ( q ) and the antipro¬ 
tons of three antiquarks (< q ), many quark-antiquark pairs are created by the violent 
collisions. The reactions between these quarks and antiquarks can generate the inter¬ 
mediate vector bosons (see Fig. 18.11). In order to reach a high event rate, which is 
calculated according to 

event rate = cross section • luminosity, (18.61) 


one needs both a large cross section and a high beam luminosity. Now one has 


luminosity 


Np-Np 

q 


(18.62) 


^ IVB 


Fig. 18.12. The existence of 
intermediate vector bosons 
could be proved for the first 
time at CERN by the collision 
of intense high-energy proton 
and antiproton beams (N-p = 
number of antiprotons in the 
beam, N p = number of pro¬ 
tons) 



Here, N p and N-p denote the number of protons ( p ) and antiprotons (p ) in the beam, 
and q represents the beam cross section. The higher the number of particles and the 
lower the beam cross section, the higher is the event rate for creating an intermediate 
vector boson. See also Fig. 18.12. 

An efficient cooling mechanism for the antiproton beams is therefore needed. Each 
particle of the beam moves by the action of magnetic fields in horizontal and vertical 
vibrations about a closed pre-set trajectory. In this context the term cooling means 
a reduction of the vibration amplitudes of the particles and thus of the beam cross 
section, or a reduction of the width of the momentum distribution of the particles 
about the mean value. This is illustrated by Fig. 18.13. Already well-tried cooling 
methods are electron cooling, cooling by synchrotron radiation, and the stochastic 
cooling , which will now be outlined in more detail. 

The motion of each particle in the beam is described by a point in a 6-dimensional 
phase space spanned by the 3 spatial and the 3 momentum coordinates. This phase 
space point is surrounded by empty space. By an appropriate deformation of the phase 
space element the particle can be shifted toward the center of gravity of the distribu¬ 
tion. This is the principle of stochastic cooling. 
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Fig. 18.13. A beam before 
and after cooling, (a) in po¬ 
sition space, (b) in momen¬ 
tum space. Part (b) essentially 
shows the particle density ver¬ 
sus the transverse momentum 


Fig. 18.14. The cooling sys¬ 
tem consisting of “pick-up,” 
amplifier, and “kicker” 

kicker 


The experimental setup for cooling of antiproton beams is sketched in Fig. 18.14. 

In the ideal case, a probe (pick-up) measures the position or the momentum of a 
particle. This tiny signal is amplified and fed to the “kicker,” which then corrects the 
transverse or the longitudinal momentum and thereby cools. Thus, the cooling can 
be interpreted as a one-particle effect, since each particle cools itself by emission of a 
self-generated signal (coherent effect). An essential prerequisite is that the particle and 
the signal reach the kicker simultaneously. Because of the finite resolving power of the 
probe in the real case, besides the desired signal the perturbing signals from other par¬ 
ticles reach the kicker too. This noise causes a heating of the particles (incoherent 
effect) and thus counteracts the cooling effect. This interplay of cooling and heat¬ 
ing mechanisms is illustrated by Fig. 18.15 and will be discussed in Exercise 18.10. 
The cooling effect is directly proportional to the signal amplification, while the heat¬ 
ing is proportional to the square of the amplification. The particle is cooled only in 
the hatched area (see Fig. 18.15). Evidently there exists an optimum of amplification 
where the cooling effect reaches an extremum value. Thus, the greater the intensity 
of the beams, the greater is the noise and the heating effect, and the less is the factor 
of optimum amplification. Generation of an intense beam of antiprotons at CERN is 
therefore performed by stages and may last several hours. The principle is illustrated 
by Fig. 18.15. 

First, an antiproton pulse of low intensity is injected at the left border of the vacuum 
chamber (1). The corresponding momentum density distribution can be seen on the 
right. The beam and its momentum width are then compressed by cooling (2). A high- 
frequency voltage is used to shift the pulse to the right side of the chamber (3), thus 
giving space for a further antiproton pulse which is injected into the chamber (4). 
After cooling, the second pulse is shifted onto the already “deposited” pulse (5). This 
procedure is repeated every 2 to 3 seconds for several hours. In this way, the longi¬ 
tudinal phase-space density is increased by accumulation of more and more particles 
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Fig. 18.15. Sectional view of 
the vacuum chamber with 
beams in various stages of 
the accumulation process. The 
right-hand part shows the cor¬ 
responding density distribu¬ 
tion versus the momentum 


(1) 

( 2 ) 

( 3 ) 

(4) 

(5) 

( 6 ) 


( <=> ) 

( CD ) 

C CD ) 

(<=> <=> ) 

C <=0 


c 



into the same momentum interval (6). The final 6-dimensional phase-space density of 
the stack is higher than the density of a single pulse by a factor of 3 • 10 8 . The intense 
antiproton beam generated this way can now be further accelerated and brought to 
collision with a proton beam. Only one year after the demonstration of the intermedi¬ 
ate vector bosons, S. van der Meer and C. Rubbia 6 were awarded the Nobel Prize in 
physics for their achievements. 

We now come back to the apparent contradiction between Liouville’s theorem and 
the method of stochastic cooling. While according to the Liouville theorem only a sin¬ 
gle pulse can be accommodated in a ring, stochastic cooling allows one to accumulate 
about 36,000 pulses in the course of a day. The final phase-space density is higher than 
that of a single pulse by a factor of 3 • 10 8 . 

However, stochastic cooling and the Liouville theorem are dealing with different 
situations. The former presupposes an ensemble of a finite number of discrete parti¬ 
cles, while the Liouville theorem presupposes a phase-space continuum (see divv !). 
A discrete ensemble thus represents only a model approximation of this condition that 
works the better the more dense the occupation of the phase-space volume becomes. 

This becomes clear by the example of the cooling rate (which will be calculated in 
the subsequent problem): 

1 W 9 

- = —(2 g-g 2 ). (18.63) 

T N 


6 Carlo Rubbia, b. March 31, 1934, Goriza. He got his education as a physicist in Pisa at the Scuola 
Normale, a time-honored university. Here he got his doctorate in 1958, after which he worked for a 
year as a research scholar at the Columbia University in New York, and then as an assistant professor 
in Rome. In 1960, he came to CERN at Geneva as high-energy physicist. Since 1972, he has held 
a chair at Harvard University. In Geneva, Rubbia was inspired by the unified theory of weak and 
electromagnetic interactions developed by A. Salam, S. Glashow, and S. Weinberg (Nobel Prize for 
physics, 1979). In 1976, Rubbia proposed to CERN the construction of a new 450 GeV SPS accel¬ 
erator for the purpose of proton-antiproton collision experiments. The accelerator achieved collision 
energies of 540 GeV, which were sufficient to create the (so far only predicted) W- and Z-bosons. 
Important for the success of the project was not only Rubbia, but also S. van der Meer, whose con¬ 
tributions made possible the generation of sharply bunched, pulsed antiproton currents. Both of them 
got the Nobel Prize for physics in 1984. 
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N denotes the number of particles in the beam, W the bandwidth of the system and 
g a gain factor that will be defined in problem 18.10. The essential point however is 
the dependence of the cooling rate on the inverse of the particle number of the beam, 
l/N. In the limit 

lim - = 0, 

N^o o T 

cooling is no longer possible, as we would expect. 

We note that the same restriction for applying Liouville’s theorem basically also 
holds in thermodynamics, but there the approximation is better by 12 orders of mag¬ 
nitude (10 12 -> 10 24 )! 

Much more important, however, is the fact that Liouville’s theorem holds on the 
condition that the particles obey the Hamilton equations, with a given Hamiltonian H. 
In this sense the particle system must be closed. But just this condition is violated by 
the reading off the particle position (coordinate, momentum pick-up) and by the cor¬ 
responding correction (kicker; see Fig. 18.14). This is a calculated interference from 
outside which cannot be described by a Hamiltonian. Hence, the Liouville theorem 
does not have to be fulfilled; moreover, it must not hold at all! 


EXERCISE 


18.10 Cooling of a Particle Beam 


Problem. 

(a) Calculate the cooling rate per second for a beam of N particles. 

(b) When does maximum cooling occur? 

(c) Calculate the cooling time for a beam of N = 10 12 particles. Let the bandwidth of 
the system be W = 500 MHz, and g = 1. 


Solution, (a) We first consider the case that the pick-up and the kicker are so fast that 
they seize each particle independently (see Fig. 18.16). Let the displacement of this 



Fig. 18.16. In the ideal case, 
the pick-up seizes one particle 


particle from the beam axis be x&. After passing the distance 1/4 (A. is the wavelength 
of the x-vibration), the deviation is corrected electromagnetically in the kicker. Let 
the correction be 

A Xk=gXk. (18.64) 

The corrected distance x' k of the particle from the beam axis is thus given by 
x' k =x k - Ax k = (1 - g)x k (18.65) 
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Fig. 18.17. A single particle is 
seized by the pick-up and its 
momentum is corrected after a 
X /4 wavelength at the kicker. 
After one revolution, the new 
trajectory leads to the cor¬ 
rected spatial displacement x' k 


Fig. 18.18. In the real case, 
the pick-up seizes several 
(N s ) particles that cause a 
noise 
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T s 


(Fig. 18.17). For g = 1, the cooling would be ideal. However, in the real case there 
appears a noise in the pick-up which is due to further N s — 1 particles passing the 
pick-up in the time interval T s (see Fig. 18.18). Thus, the pick-up measures not only 
the spatial displacement Xk of the kth particle from the beam axis (x = 0), but also 
that of the additional N s — 1 particles located around the kth one. The recorded spatial 
displacement is therefore the mean value of all N s seized particles (the kth and the 
N s — 1 located around the kth particle): 


1 N * 


7 = 1 


(18.66) 


For clarity, we will label the kth particle in the sum on the right-hand side, e.g., the 
numbering will be chosen so that j = 1 just denotes the kth particle. Moreover, it 
should be clear that the remaining N s — 1 particles are closely located around the kth 
one when passing the pick-up. We therefore add the index k to the particular spatial 
displacement xj : 


(Xlc) = 


1 

N~s 



(18.67) 


where x\j ( = xjt and x jj : / x* for 2 <j<N s . 
The correction for the £th particles is now 


x k=Xk- g{Xk)- 


(18.68) 


This means that there will be no kick if the sample of N s particles on the average 
moves on the beam axis: 


x' k = x k if (xk) = 0. 


(18.69) 
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In other words, the kicker will not be activated if the center of gravity 


S k = 


1 

mN s 


N s l N s 

Y^ mX N k = wYl X N k = ^ k) 

7=1 S 7=1 


(18.70) 


of the sample in the pick-up is already on the beam axis (all particles have the same 
mass m). 

For real measurements in the pick-up, this will, of course, not be fulfilled in gen¬ 
eral. The probability that the center of gravity of the N s particles that are statistically 
distributed over the beam just coincides with the beam axis is extremely low. In the 
realistic case the sample will always be “kicked”. 

We now want to know how the mean value of the spatial displacement of all N 
particles in the beam will change by the mechanism of stochastic cooling. This mean 
value is 


Exercise 18.10 


E{x k ) = ^Y jXk (18.71) 

^ k= 1 

and will be denoted by E {xjf} (expectation value) to distinguish it from the mean value 
for the sample of N s particles, (jcjt), which was defined in (18.66). It is however clear 
that the mean value of the positions of all the particles just defines the beam axis. 
Since we put the beam axis at the origin of the coordinate system, v = 0, the mean 
value of the spatial displacement of all the particles from the beam axis just vanishes: 

E(x k ) = 0. (18.72) 

This always holds, independent of the mechanism of stochastic cooling. Thus, the 
mean value E(xk) is not an appropriate quantity for investigating the mechanism of 
stochastic cooling. It is evident that the mean square of the spatial displacement E(x%) 
is much better suited for this purpose. We therefore will investigate the change of 
E(x%) by the stochastic cooling mechanism. First we consider the mean square spatial 
displacement x\ for the kth particle, and to this end, we square (18.68): 

x k 2 = xl-2gx k {x k ) + g 2 {x k ) 2 . (18.73) 

The change of x\ for a single passage through the kicker is thus given by 

AC**) := x' k 2 ~x\ = -2 gx k (x k ) + g 2 {x k ) 2 . (18.74) 

Since there is one kick per revolution, this is also the change of x 2 per revolution. By 
averaging over all particles, one obtains 

E( A(xf)) = E{x' k 2 - xj) = E(x ' k 2 ) - E{x 2 k ) 

= A = -2 gE(x k (x k ))+g 2 E((x k ) 2 ). (18.75) 

The second equals sign in the first line follows from the additivity of the expectation 
value £(...); compare (18.71). 

To calculate the change of the expectation value of the mean square of the spatial 
displacement per revolution, A (E(x%)), the expectation values E(xk{xk)), E((xk) 2 ) 
must be expressed by £(x|). We then obtain A (E(x%)) as a function of E(x^), or a 
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Exercise 18.10 


differential equation for E(x%), the solution of which allows us to calculate the desired 
quantities. 

To evaluate E(xk(xk)), we write with (18.67) 



(18.76) 


In the first term, we used x\^ = Xk, and in the second one Xk=x\^- 

We now realize that two different particles in the beam cannot be correlated (the 
particles are statistically distributed over the beam!). Even though they belong to the 
same sample of N s particles around the &th particle, their spatial displacements x^k 
and Xj'k, i ^ j, on the average must satisfy 


1 N 

E(xi'kXj f k) = ~ ^ ' x i,k x j,k = for i 7 ^ j . 


(18.77) 


k= l 


Thus, if i = 1 and 2 < j < N s , then 


1 N 

E(xi t kX jf k) = ~ ^2 x i,kXj,k = 0 . 


k= l 


(18.78) 


This is now utilized in the second term of (18.76), which then vanishes. The first term 
can immediately be rewritten using the definition of E , and we obtain 


E(x k {x k )) = 2-£(xf). 

™s 

Furthermore, 


E((x k ) 2 ) = 


1 

N 


N 


£ 


i 

kc 

1 y s 


N s 

^ ' x i,k x j,k 

iJ=1 



T - 'y ' Xj,kX j,k 

ij =1 


(18.79) 


(18.80) 


The second term again vanishes by using (18.77). In the first term we first average by 
summing over all particles, 



(18.81) 


The mean square spatial deviation E(xf k ) cannot depend on the label i of the particle 
from the sample of N s particles, E(xf k ) = E(x%). The sum over i therefore yields 
only the additional factor N s , and we obtain 

E((x k ) 2 ) = 2 - E(xl). 

tN s 


(18.82) 
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Equation (18.75) with (18.79) and (18.82) thus turns into 


A(E(x 2 k )) = -^ 1 ^-E(x 2 k ) 


(18.83) 


for the change of the mean square spatial displacement per revolution. The “differen¬ 
tial” change dE(x%) per “differential” revolution dn is 

dE(x k ) _ 2 g-g 2 


-E(xl). 

dn N s k 

This differential equation is solved by the function 

. 2 g - g 2 " 

N x 


E(xf) = C exp I —n 


For the root of the mean square (rms) spatial displacement 


•^rms •— Y EipCjf), 

we obtain 


-^rms — 


V^Cexp ^—n 


2 g~g z 
2 N x 


irms decreases to the eth fraction of its original value after 


2 N s 


n o = 


revolutions. Since each revolution takes the time T, it thus lasts for 
2 N S T 


x = noT = 


2 g~g 2 


(18.84) 


(18.85) 


(18.86) 


(18.87) 


(18.88) 


to reduce x rms to the fraction 1 /e of its original value. Since among N particles orbit¬ 
ing in the time T, N s particles are seized in the time T s , for a homogeneous particle 
flux density we have 


N S = N — , 

s T 

and therefore, 
2 NT S 

T = 




2* 


(18.89) 


(18.90) 


Exercise 18.10 


With the bandwidth W = 1/2Ty (Nyquist’s theorem) or 
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Fig. 18.19. 



we finally obtain the cooling rate 
1 W 9 

- = —(2 g-g 2 ). (18.92) 

r N 


(b) From the discussion of (18.92), it follows immediately that the cooling rate is 
maximized for g = 1. For g > 2, the particles are heated up. 

(c) With the numerical values given in the problem, for the cooling rate we get 


1 500 MHz , 1 

- =- -1=5-10“ 4 - 

r 10 12 s 


(18.93) 


and thus, 


r = 2 • 10 3 s 



(18.94) 
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Given a Hamiltonian H = H(qj , pj , t), the motion of the system is found by integra¬ 
tion of the Hamilton equations: 

dH . dH 

Pi = ~— and q t = — . (19.1) 

Hi Hi 

For the case of a cyclic coordinate, we have, as we know, 

dH „ . . ^ 

— = 0, i.e., pi = 0. 
dqt 

Hence, the corresponding momentum is constant: pi = Pi = constant. 

Whether or not H contains cyclic coordinates depends in general on the coordi¬ 
nates adopted for describing a problem. This is immediately seen from the following 
example: If a circular motion in a central field is described in Cartesian coordinates, 
there is no cyclic coordinate. If we use polar coordinates (q, <p), the angular coordinate 
is cyclic (angular momentum conservation). 

A mechanical problem would therefore be greatly simplified if one could find a 
coordinate transformation from the set pi , q t to a new set of coordinates P t , Q\ with 


Qi — Qi(P j > Qj > 0 > Pi — Pi iP j 5 Qj ’ 0 > 


(19.2) 


where all coordinates Q t for the problem were cyclic. Then all momenta are constant, 
Pi = Pi , and the new Hamiltonian H f is then only a function of the constant momenta 
Pi ; hence, H' = H'(Pj). Then 


Qi = 


dH'(Pj) 

dPi 


= coi = constant, 


Pi = ~ 


dH'(Pj) 

dQi 


= 0 . 


Then integration with respect to time leads to 


Qi =<Dit + coo, Pi = Pi = constant. 


Here, we presupposed that the new coordinates (Pi , Qi) again satisfy the (canoni¬ 
cal) Hamilton equations, with a new Hamiltonian H'(Pj, Qj, t). This is an essential 
requirement for a coordinate transformation of the form (19.2) to make it canonical. 

Just as pi is the canonical momentum corresponding to qi (pi = dL/dqQ , Pi shall 
be the canonical momentum to Qi . A pair (qi,pQ is called canonically conjugate 
if the Hamilton equations hold for qi and pi . The transformation from one pair of 
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canonically conjugate coordinates to another pair is called a canonical transformation. 
Then 


Qi = 


dH' 

9^’ 


Pi = ~ 


dH' 

Wi' 


(19.3) 


At the moment, we do not yet require that all Qi be cyclic. This case will be con¬ 
sidered later (Chap. 20). 

In the new coordinates, we require Hamilton’s principle to be maintained. Thus, 
for fixed instants of time, t\ and t 2 , we have both 


8 


ti 

J L(qj, qj, t)dt = 0 


h 


and 


*2 

5 J L'(Qj, Qj, t)dt = 0. 

Thus, the difference 


8 



L r )dt = 0 


(19.4) 


also vanishes. 

We observe that (19.4) will then be fulfilled even if the old and new Lagrangians 
differ by a total time derivative of a function F: 


L 



because 



n,) = o, 


since the variation of a constant equals zero. As we shall see, the function F medi¬ 
ates the transformation (pi,qQ to (Pi, Qi). F is therefore also called a generating 
function. In the general case, F will be a function of the old and the new coordinates; 
together with the time t it involves An + 1 coordinates: 


F — F(pj, qj, Pj, Qj, t). 

But since simultaneously there are 2 n transformation equations 


Qi — Qi(P j > Qj > 0 > Pi — Pi(P j > Qj > 0 > (19.5) 

F involves only 2^ + 1 independent variables. F must contain both a coordinate from 
the old coordinate set pi (or qQ and one of the new Pi (or QQ to enable us to establish 
a relation between the systems. Hence, there are four possibilities for a generating 
function : 



F\ — F(@j, Qj,t), 
F 3 = F(pj , Qj,t), 


F 2 = F(gj,Pj,t ), 
F 4 = F(pj , P/,0. 
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(19.6) 


Each of these functions has 2^ + 1 independent variables. The dependency must be 
selected in a suitable way, according to the actual problem. We now derive the trans¬ 
formation rules of the form (19.2) from a generating function of type F \. 


Because 



L = L' 4—— and 


09.7) 


dt 


we have 



(19.8) 


For the total time derivative of F\ we then have 



(19.9) 


We insert this expression into (19.8), which yields 



By comparing the coefficients, we obtain 


dFi(qj, Qj,t) 


dqi 

dFiiqj^ Qj,t) 


(19.10) 



We are now prepared to derive the transformation equations for a generating func¬ 
tion of the type F 2 , which is also denoted by S : 

F 2 = S = S(qj, Pj,t). 

For the derivation, we will use a comparison of coefficients as for F \; therefore, we 
require that F 2 be composed as follows: 



(19.11) 


since then we can consider the problem analogously to F\. We imagine the Qi as 
being expressed through the second equation of (19.10), i.e., through 
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According to (19.8), we have 


E Pm ~H = J 2 p i Qi ~ H ' + J t Fx 


= Y PiQi-H'+- ( F 2 (qj p i Qi )• 


This leads to 


E PM -Y,PiQi-H + H' = £. ^F 2 (qj, Pj P i Qi 


V-V 3^2 . 3^2 5 3^2 

= E^+E^ + ir 
-Y. p iQi-Y. p iQi 


v-^ . v-^ . . v-^ a^ 2 . v-^ a^ 2 . a/^ 

EM i + Ew.-*+»' = E ¥ «+E^+ ¥ - 


Comparing again the coefficients now yields the equations 


Pi = 


dF 2 (gj, Pj,t) 
Hi 


Qi = 


dF 2 (qj, Pj,t) 


dPi 


H'(P j ,Q j ,t) = H(p j ,q j ,t) + 


dF 2 (qj , Pj,t) 
dt 


(19.12) 


The first two relations allow us to determine the transformation equations qi = 
qi(Qj , Pj,t) and pi = Pi(Qj , Pj,t), which by insertion into the third equation 
of (19.12) yield the new Hamiltonian H'(Pj , Qj, t). 

The transformation equations for the other types of generating functions are ob¬ 
tained analogously, by choosing an appropriate sum which enables us to use the meth¬ 
ods of the first two problems. 

From (19.10) and (19.12), we obtain the dependence of the new coordinates 
(Pi, Qi) on the old (pi , qi) and vice versa. For the case F\, from 


Pi = 


d F\ (qj , Qj,t) 
dqi 


follow the equations pi = pi (qj, Qj,t), which can be solved for the Qi : 


Qi = Qi(Pj,qj,t). 
Insertion into the equations 

dFliqj, QpQ 
1 dQi 

then enables us to calculate 
Pi = Pi(Pj,qj,t). 
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We now understand the name generating function for F \ The function F determines 
the canonical transformation 


Qi — QiiPj > Qj > 0 » Pi — PiiPj > Qj » 0 


through equations of the type (19.10) or (19.12). 

By means of Legendre transformations, we may furthermore define generating 
functions F$(pj, Qj,t) and F^(pj, Pj,t ). Based on the Legendre transformation 

^3 (Pj i Qj,t) = Fiiqj, Qj,t ) - y] qtPi (19.13) 


we obtain with a similar derivation the canonical transformation rules 
dF 3 (pj,Qj,t) dF 3 (pj,Qj,t) 


<?; = 


H' = H + 


dpi 

<FFj(Pj^ QpJ) 

dt 


Pi = 


dQi 


Starting finally from 


F\(p j , Pj ,t) = F 3 (pj, Qj,t) + ^2 Qi Pi 

i 


the following transformation rules emerge 


Qi = ~ 


dF4(pj, Pj,t) 
dPi 


Qi = 


dF^jpp Pj,t) 

3 Pi 


(19.14) 


H '_ H i W^Pj’Pj’t) 
dt 


Calculating the second derivatives of the generating functions ^1,2,3,4 , we find the fol¬ 
lowing relations to apply between old and new coordinates under a canonical trans¬ 
formation 


dQi _ d 2 F 2 _ dpk dQi _ d 2 F 4 _ Bq k 

3 Pi _ 3 2 Fi _ 3 Pk 3 Pi _ 3 2 _F3 _ dq k 

dqk dqkdQi dQi' dpk dpkdQt dQi 

Exactly the existence of these mutual relations between old and new coordinates dis¬ 
tinguishes a canonical transformation from a general transformation (19.2) of the sys¬ 
tem’s coordinates. For the latter, (19.15) do not hold. 

In the preceding derivation, the Hamiltonians H (qj , pj , t) and H'(Qj , Pj , t) were 
conceived as alternative descriptions of the same dynamical system. On the other 
hand, we may as well conceive H and H' as describing different dynamical system. 
A canonical transformation of H into H f then establishes a correlation of both dy¬ 
namical systems. This way, it is sometimes possible to find the solution of a given 
dynamical system by canonically transforming it into a second system that is easier to 
solve. The solution of the original system is then obtained by canonically back trans¬ 
forming the solution of the second system. With examples 19.4 and 21.16, we shall 
work out the solutions of the damped and the time-dependent harmonic oscillators, 
respectively, by canonically transforming these systems into the ordinary harmonic 
oscillator. 
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19.1 Example of a Canonical Transformation 

Let the generating function be given by 

F\ (q j , Qj ) = ^ ' Qk Qk- 

k 

According to (19.10) the particular transformation rules follow as 
dFi dFi 

Pi = ^r 1 = Qi> p i = ~J7F = -<H> H ( p J> Qj ) = H(qj, P j). 

dq t dQi 

The example shows the in the Hamiltonian formalism, the momentum and position 
coordinates play equivalent parts. 

EXAMPLE 


19.2 Point Transformations 

We consider the canonical transformation that is defined by the particular generating 
function 

Flip j > Pj ,t) = ^ ' Pkfkiq j , t), 

k 

with arbitrary differentiable functions fk(qj , t). The transformation rules (19.12) for 
this F 2 follow as 

v—> dfk 

Qi = fMjJ ), Pi = Fj P ^F' 

k (>q ' 

H\Qj ,Pj,t) = H{q j ,pj,t) + Y J Pk d F. 

k 01 

The new position coordinates Qi thus emerge as functions of the original position co¬ 
ordinates qi , without any dependence on the momentum coordinates. Transformations 
of this type are referred to as point transformations. This class of transformations is 
generally canonical as we can always construct the corresponding generating function. 
The particular case fk(qj , t) = qk then defines the identical transformation 

Qi =qi , Pi = ^2 p k&ki = Pi, H\Qj, Pj,t) = H(qj, pj,t). 
k 

EXAMPLE 


19.3 Harmonic Oscillator 

The kinetic energy T(p) and the potential energy V(q) of a particle be given by 

p I 9 I 999 & 

T(p) = —, V(q) — -kq = -mco q , co= —, m, k, co = const., 

2 m 2 2 m 
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with m denoting the particle’s mass, k a characteristic constant of the oscillator, and 
co its characteristic frequency. The Hamiltonian of this system is then 


P 2 1 99 

H(q, p) = — + -mco 2 q 2 . 
2m 2 


(19.16) 


The canonical equations and the equation of motion follow as 


on p\L) on 9 9 

q(t)= — = -, —p(t) = —— = q(t)mco 2 , q+co 2 q= 0. (19.17) 

op m oq 

The direct way to evaluate the dynamics of this system is to integrate the equation of 
motion. Here, we choose the “detour” over the canonical transformation formalism, 
namely to map our system into another system with Hamiltonian H f whose canonical 
equations are even easier to solve. As the “target Hamiltonian” H ', we choose 


H\P) = coP . 


(19.18) 


A simple transformation (FI; q, p) ( HP, Q), that provides this mapping is obvi¬ 
ously 


Example 19.3 


q = 



sing, 


p = \l2mcoP cos g, 


H f = H. 


(19.19) 


We observe that the new momentum P has acquired the dimension of an action , 
whereas the new position coordinate g is now dimensionless, i.e. an angle. In order to 
ensure that the form of the canonical equations is maintained in the new coordinates, 
we must test whether this transformation is actually canonical. To this end, we must 
find a generating function that yields the transformation rules (19.19). 

We try a generating function of the form F\(q,Q,t). The transformation 
rules (19.19) are first cast into the particular functional form that corresponds to 
the generating function F\(q, Q,t ), hence into the form p = p(q , g, t) and P = 

P(q,Q,t) 

1 9 1 

p = mcoq cot g, P = -mcoq —^—, H =H. (19.20) 

2 shr g 

We must now find a function F\(q , g, t) that yields these particular transformation 
rules according to the general prescriptions (19.10). Thus, F\(q, g, t) must satisfy 


-= mcoq cot g, 

dq 


dFi 

92 


-mcoq 



Obviously, such a function exists and is given by 



I 9 

Fi(q, Q) = -mcoq cotg. 

The transformation (19.19) thus establishes indeed a canonical transformation. This 
may be observed also from the fact that the rules (19.20) satisfy the symmetry condi¬ 
tions (19.15) 


3 P mcoq 
3 q sin 2 g 


3 p 
dQ 
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With the evidence of the transformation (19.19) being canonical, it is ensured that the 
transformed system (19.18) constitutes on its part a Hamiltonian system — and hence 
the maintains the canonical form of the canonical equations. Explicitly, the canonical 
equations of the transformed system are 


dH' 

Q(t) = -= co, 

dP 


dH' 

oQ 


: 0 . 


These equations are thus equivalent to the original canonical equations (19.17) that 
emerged from the original Hamiltonian (19.16). As H' does not depend on Q, we 
observe that the new canonical position coordinate Q is cyclic, hence that its conjugate 
canonical momentum P represents a conserved quantity. The canonical equations for 
Q(t ) and P(t) can be immediately integrated, yielding 


Q(t) = cot + Q( 0), P(t) = P( 0). 


The system’s dynamics are thus completely solved in the simplest possible manner. 
Inserting the solution functions Q(t) and P{t) into the transformation rules (19.19), 
we obtain the solutions in the original coordinates q(t) and pit) 

q(t) = J —— sinlcot + g(0)), p{t) = JlmcoPiO) cos (cot + 2(0)). 

V mco v 7 v 7 

The trigonometric functions can finally be split by means of the addition theorems. 
According to (19.19), the values sin 2(0) and cos 2(0) can then be expressed in terms 
of the initial conditions g(0) and p(0 ) of the original system 

pi 0) 

q{t) =q (0) cos cot H-sin cot, p(t) = —q(0) mco sin cot + p (0) cos cot. 

mco 

As expected, we find the solution of the harmonic oscillator exactly in the form as we 
would have obtained by a direct integration of the canonical equations (19.17). 

At this point, one could well argue that overall effort needed to solve the canonical 
equations (19.17) along the “detour” over the canonical transformation method is even 
larger than that for the direct solution. But this is only due to the simplicity of the orig¬ 
inal system. The example here was just chosen to demonstrate the method consisting 
of three steps: (i) forth transformation of the initial conditions into a second system, 
(ii) solving on that basis the dynamics of the second system, and (iii) transforming 
back the obtained solution into the original system coordinates. We may depict both 
alternatives by means of the following diagram: 


Solution of the canonical equations 
of Hamiltonian H 

(H;q(0), p(0)) -- (q(t), p(t)) 


Canonical forth 
transformation 
of Hamiltonian 
and initial 
conditions 


M(0) M~\t) 

Solution of the canonical equations 
of Hamiltonian H f 


Canonical back 
transformation 
of the solution 


(#'; 2 ( 0 ), pm 


(! Q(t),P(t )) 
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In the next Example 19.4, we will show that the method to determine the dynamics 
of a given system by transforming it into a second system that is easier to solve can 
indeed reduce the overall effort, as compared to a direct solution of the original system. 
This will become obvious with Example 21.16, where we treat the time-dependent 
damped harmonic oscillator. This case has long been thought of as possessing no 
analytic solution. Yet, the solution of this problem by means of a generalized canonical 
transformation is fairly straightforward. The price to pay is that we must find the 
appropriate generating function. 


EXAMPLE 


19.4 Damped Harmonic Oscillator 


The Hamiltonian of the damped harmonic oscillator is explicitly time dependent 
v 2 1 

H(q, p, t) = — e~ 2yt H— mco 2 e 2yt q 2 , (19.21) 

2m 2 

with the abbreviations 2 y = fi/m and to 2 = k/m. As before, m stands for the mass 
of the moving point particle, for the friction coefficient, and k for the oscillator’s 
constant. The canonical equations follow as 


q(t) = 


dH 

dp 


m e -2y t 


m 


~P( t) = ^ =q(t)mco 2 e 2yt . 
aq 


In the left-hand side equation, we see that the canonical momentum p(t) no longer 
coincides with the kinetic momentum p^mit) = mq(t ), provided that y ^ 0, 


pit) = mq(t)e 2yt - Pkin(t)e 2yt . 


We may combine the two first-order equations into one second-order equation for q(t) 
to obtain the equation of motion of the damped harmonic oscillator in its common 
form 

q + 2yq + co 2 q = 0. (19.22) 


Instead of solving this equation directly by means of an appropriate Ansatz function, 
we will first map the Hamiltonian (19.21) by means of a canonical transformation into 
the Hamiltonian of an undamped harmonic oscillators. In present case, the canonical 
transformation will be based on a generating function of type F 2 , namely 

F 2 (q , P, t) = e yt qP — -my e 2yt q 2 . 

According to (19.12), the subsequent transformation rules follow as 


Example 19.3 




374 


19 Canonical Transformations 


Example 19.4 


As the new position coordinate Q solely depends on the old position coordinate q , we 
are dealing here with a particular case of the general class of point transformations. 
Furthermore, the relation between old and new coordinates is obviously linear. We 
may thus express the transformation rules in matrix form. Solving for the old coordi¬ 
nates this yields 


(q\ = ( e-y' 0 \(Q\ 
\pj \—mye yt e yt ) \P) ' 


(19.23) 


According to the rule (19.12) for the mapping of the Hamiltonians, we get the 
new Hamiltonian H'(Q,P,t ) by expressing the original Hamiltonian H(q,p,t) 
via (19.23) in terms of the new coordinates Q, P and, moreover, by adding 3 F 2 /dt 

H' = -m _1 e~ 2yt (— mye yt Q + e yt P) 2 + -mco 2 e 2yt e~ 2yt Q 2 + yQP — my 2 Q 2 

= ^ m~ l (P my Q) 2 + ^ mco 2 Q 2 + yQP my 2 Q 2 . 

In the present example, we thus find a transformed Hamiltonian H r that no longer 
depends on time explicitly 

H\Q, P)= ^- + \mw 2 Q 2 , M 2 = co 2 -y 2 . 

2m 2 

We now observe that H r emerges as exactly the Hamiltonian of an undamped har¬ 
monic oscillator with angular frequency do = yjco 2 — y 2 . Its solution is already known 
from Example 19.3 


/2(0\ _ / cos dot m 1 do 1 sin5A /2(0)\ 
\P(t)J \ —mdo sin dot cos dot J\P(0)J 


The solution functions q(t) and p(t) of the damped harmonic oscillator now follows 
as the product the solution (19.24) and the canonical forth and back transformations, 
given by (19.23) and its inverse 

/#(0\_/ e ~ yt cos dot m l db~ l sincoA / 1 0\ /g(0)\ 

\p(t)J \—mye yt e yt J \—m do sin dot cos dot J \my IJ \p(0)J ’ 


On the right-hand side, the initial conditions 2(0), P( 0) of the transformed system 
were expressed through those of the original system, ^(0), p( 0). Explicitly, according 
to the inverse transformation of (19.23) at t = 0, we have 2(0) = g(0) and P( 0) = 
myq(0) + p(0). The determinants of all matrices are unity and hence the determinant 
of the combined linear mapping ( q(0),p(0 )) ( q(t),p(t )). This is in agreement 
with the requirement of Liouville’s theorem. 

In the form of the product of three matrices, it becomes obvious that the solution 
method via canonical transformation consists of the three steps, as sketched at the 
end of Example 19.3. We may finally express the solution of the damped harmonic 
oscillator (19.22) concisely by multiplying the matrices 



= R(f) 


(q(0)\ 

\P(0)J ’ 
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with 


Example 19.4 


m 


yt [cos cot + yco 1 sin cot] 
—e yt m co 2 co~ x sin cot 


e yt m x do 1 sin cot 
e yt [cos cot — yco~ l sin cot] 


-2 2 2 
co —co — y . 


The present example shows that the task of solving the equation of motion of a given 
dynamical system can be facilitated if we succeed to represent it as the transformed 
solution of a another system that is easier to solve. But this works only if we can find 
an appropriate generating function. 


EXAMPLE 


19.5 Infinitesimal Time Step 


We consider the particular canonical transformation that is generated by the function 


Fi(qj, Pj,t) = y ]Pi + H(qj,pj,t)St. 

i 


(19.25) 


Herein, H stand for the Hamiltonian of the given dynamical system, and 8t for an 
infinitesimal interval on the time axis. From the general form of transformation rules 
for generating functions of type F 2 we obtain the particular rules for (19.25) as 


dF 2 dH dpt 

Pi = — £ = Pi +- St = Pi - —St, 

<kjj dcji dt 

dF 2 dH 1st ordering dH dqt 

Qi = ^W= <U + WW 81 = c h + T—' St = C P + “TT'V 

dPi dPj dpi dt 

, 3F 2 dH dH 

H = H + —- = H + - St = H + - St. 

dt dt dt 


In last rightmost terms of these equations, the canonical equations were inserted, re¬ 
spectively. Solving for the transformed quantities, this means 


Pi = Pi + Pi 
Qi = Qi F q t 8t , 
H' = H + H8t. 


We now observe that the particular generating function (19.25) defines precisely the 
canonical transformation that pushes the system ahead by an infinitesimal time step 8t. 
As any canonical transformation can be applied an arbitrary number of times in se¬ 
quence, we can conclude that the transformation along finite time steps is also canon¬ 
ical. This is an important result: the time evolution of a Hamiltonian system consti¬ 
tutes a particular canonical transformation. As already stated, the class of canonical 
transformations are characterized by their property to map Hamiltonian systems into 
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Hamiltonian systems. It is thus ensured that a Hamiltonian system remains a Hamil¬ 
tonian system in the course of its time evolution. 


EXAMPLE 


19.6 General Form of Liouville’s Theorem 


With the theory of canonical transformations at hand, we may cast Liouville’s theorem 
into the following general form: the volume element dV = dq \... dq n dpi ... dp n of 
a Hamiltonian system with n degrees of freedom is invariant with respect to canonical 
transformations, 

dQi.. .dQ n dP \... dP n ™ n '= nsf ' dqi . ..dq n dp \.. .dp n . 


For general transformations of the system’s coordinates, the transformation of the 
volume element dV is determined by the determinant D of its Jacobi matrix 

dQ i.. .dQ n dP \.. .dP n = D dq \.. .dq n dp\ ... dp n , 

_ d(Q\,..., Q n , P\,..., P n ) 

d(qi,...,q n ,pi,...,p n ) 


Liouville’s theorem thus states that the determinant D of the transformation’s Jacobi 
matrix is unity in case that the transformation is canonical. 

For the sake of transparency, we first prove Liouville’s theorem for the case of a 
system with one degree of freedom, i.e., for ft = 1. For such a system, the determinant 
D of the Jacobi matrix emerging from a transition from old coordinates q, p to new 
coordinates Q = Q(q, p), P = P(q, p) is given by 

D _ d(Q,P ) _ dQdP dQ dP 
3 (q,p) 3 q dp dp dq 

According to the general rule for the partial derivatives of the inverse functions q = 
q(Q, P ), p = p(Q, P) we have 

dp _ 1 dQ 
d~P ~ ~D^q' 

which means that the determinant D of the Jacobi matrix can be expressed as 


D = 


d_Q 

dq 


dp_ 

dp 


n-i 


(19.26) 


In the particular case of a canonical transformation, the transformation rules can be 
derived from a generating function F^q, P, t), as stated in (19.12), 


3 F 2 

Q = —-, 

3 P 


P = 


d_F2 

dq 


Inserting the expressions for Q and p, the determinant D is equivalently expressed as 


D = 


d 2 F 2 f d 2 F 2 I -1 


= 1 . 


dqdP \_dPdq _ 

But this equals unity as the partial derivatives may be interchanged. 
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The proof for the general case of systems with n degrees of freedom is worked out 
analogously. In the case of a Hamiltonian system, the determinant D of the Jacobi 
matrix that is associated with a general transformation of the system’s coordinates has 
an even number of rows (columns). We assume the transformation to be invertible. 
Then, we may express the new position coordinates Qi = Qi(qj,Pj) as functions 
of the old coordinates, and the old momenta as functions of the new coordinates, 
Pi = pi(Qj , Pj ). The determinant of the associated Jacobi matrix is the represented 
by 

D _ Qn) d(pi,---,Pn) 

d(qi,...,q n ) _d(P\,...,P n ) 

which generalizes the relation (19.26). Provided that a generating function 
F 2 (qj , Pj, t) exists, then the transformation is referred to as canonical, and the trans¬ 
formation rules are given by (19.12). Inserting Q\ and pt yields 


(19.27) 


D = 


/ 3 2 ^2 \ 

f d 2 F 2 \ 

\dqjdPi) 

\dPjdqi) 


= 1 , 


which is again unity as we may interchange the sequence of partial derivatives and 
due to the fact that determinants of transposed matrices coincide. 

We finally remark that the generating function F 2 used here in this proof is com¬ 
pletely equivalent to the other types of generating function. For, the determinant 
(19.27) of the transformation’s Jacobi matrix has the equivalent representations 


HPu •••, Pn) d(pi,...,p n ) 
d(qi, ...,q n ) _d(Qi, ..., Q n ) 


d(P\,...,P n ) " d(qi,...,q n ) 
d(pi,...,p n ) \_d(Qi, ..., Qn)_ 


d(Ql,---,Qn ) 

d(pi,...,p n ) ld(P\,...,P n ) 


The result D = 1 for a canonical transformation then follows in the same way as above 
by inserting the rules into the appropriate generating function F\, F 3 , or F 4 . 

With the result of Example 19.5, we know that the time evolution of a Hamiltonian 
system can be conceived as a particular canonical transformation whose generating 
function is based on the Hamiltonian H . This yields the more special version of Liou- 
ville’s theorem from Chap. 18, where it was stated that the volume element dV of a 
Hamiltonian system is invariant in the course of the system’s time evolution. 


Example 19.6 


EXAMPLE 

19.7 Canonical Invariance of the Poisson Brackets 

For a Hamiltonian system H(qj , pj , t) of n degrees of freedom, and for two differen¬ 
tiable functions F(qj, pj, t ), G(qj, pj, t) of the canonical variables and time t, the 
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Poisson bracket 1 of F and G is defined by 


^\d qi d Pi dpidqij 


i =1 


(19.28) 


A special case is established if we set up the Poisson brackets of the canonical vari¬ 
ables qi and pi . As these variables are required to not depend on each other, we im¬ 
mediately get 


[qi,qj]= 0, [ p,, Pj \ = 0, \qi, pj\ = <5, 


(19.29) 


We hrst convince ourselves that the same relations hold for canonically transformed 
coordinates Qi and Pi, hence that the fundamental Poisson brackets (19.29) are in¬ 
variant under canonical transformations. Making use of the relations (19.15), we find 


k= 1 

n 

= E 


k =1 


[Qi 


k =i 

n 


2 ' 

k 


'd Qj 

d Qj 

3 Qi 3 Qj 


_ dqk 

dpk 

dpk dqk 

~ 

~dpk 

3 Qj 

dqk dQj~ 

d Q i 

= =0 

_dPi 

3 Pk 

3 Pi dqk _ 

3 Pi 

r dp 

d_Pl _ 

3 Pi 3 Pj' 


\_dqk 

3 Pk 

dpk dqk _ 


f dpk dPj 

dqk d P J 


\_ 9 Qi dp k 

d Qi dqk 

;J 3 Qi 

\d_Qi_ 


d_Qi dpp 


1 dqk 

3 Pk 

3 Pk d q k _ 


I" 3 Pk 

[dPi 

3 P, 
dpk + 

dqk 3 Pj' 

3 Pi dq k _ 

3 Pj 

= —— = Sn. 
dPi 1 


(19.30) 


We are now prepared to show that the Poisson bracket of two arbitrary functions 
F(qj , pj, t) and G(qj, Pj,t) establishes likewise a canonical invariant. The time t 


1 Simeon Denis Poisson , French mathematician and physicist, b. June 21, 1781, Pithiviers, France- 
d. April 25, 1840, Paris, France. Descending from a simple social background—his father was a 
soldier—Poisson had good teachers who recognized his extraordinary gifts and made it possible for 
him to begin studies at the Ecole Polytechnique in Paris in 1798. There, his mathematical talents 
were recognized by Laplace and Lagrange. Poisson became an assistant professor, and, in 1806, a 
full professor at the Ecole Polytechnique, where he energetically worked to improve teaching and the 
formation of students. 

His research initially was focused on the theory of ordinary and partial differential equations, 
which he applied to many different physical problems. Thus, Poisson developed further the mechanics 
of Laplace and Lagrange, and studied problems related to the propagation of sound, elasticity, and 
static electricity. He later turned his interests towards the theory of probabilities, and recognized the 
seminal nature of the Law of Large Numbers. 

Many ideas and concepts are named after Poisson, such as the Poisson equation in potential the¬ 
ory, the Poisson bracket of mechanics, the Poisson ratio in elasticity, and the Poisson distribution in 
statistics. 
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as the common independent variable of both the original and the transformed system 
is not transformed. We may thus restrict ourselves to the nested mapping 

F(qj,pj) = F(Q k (qj, pj), P k (qj, pj )), 

G(qj, pj) = G(Q k (qj, Pj), Pk(qj,Pj)). 


Applying the chain rule, one finds 


aa 9f^\/jG82j 9G8P(\ 

’ q,P 4 dq k dPi dq k )\dQj dp k + dPj dp k ) 

K l J 

( dF dQi dF dP t \/dG dQj dG apAl 

~~ (acT ~dF + dPi WkKWj ~du + Wj y 

Multiplying and recollecting the terms for Poisson brackets with respect to the coor¬ 
dinates Qi , Pj yields the equivalent expression 


dF dG dF dG 

[F, G\a v = ? 7 \ - [Qi, Qj] +- [Pi, Pj] 

q 'P dQi dQj J dPidPj J 


dF dG dF dG 

+- [Qi, Pj] - [Qi,Pii 

dQi dPj J dPi dQj J 


(19.31) 


Equation (19.31) holds for any invertible coordinate transformation. In the particular 
case that the transformation is canonical , then in addition the relations (19.30) for the 
fundamental Poisson brackets apply. In that case, (19.31) simplifies to 


[F, G] q , p = 



dF dG 
~d~Qi ~dPj 


dF dG 

dPidQ'j 


8ij = [F, G]g,p. 


The Poisson bracket [F, G] is thus uniquely determined by functions F and G and 
independent from the underlying coordinate system, provided that a transformation of 
the coordinate system is canonical. 


Example 19.7 


EXAMPLE 


19.8 Poisson’s Theorem 


Poisson’s theorem embodies an important benefit of the Poisson bracket formalism: if 
two invariants I\ and h of a given dynamical system are known, then it is possible to 
directly construct a third invariant I 3 . In order to demonstrate this, we first derive the 
general rule for the total time derivative of a Poisson bracket 


d 

dt 


[F, G] = 



+ 


F, 


dG~ 

dt 


The proof is easily worked out by directly calculating the total time derivative of the 
Poisson bracket’s definition from (19.28) 
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dG d ( dF\~ 
dqi dt \dpi)_ 



3G 3 / 3F dqj 3F dpj 3F\ 

dqi dpt \dqj dt dpj dt dt ) 

dG d / dF dqj dF dpj dF\ 

dpi dqi \dqj dt dpj dt dt ) 

dF d / dG dqj dGdpj dG\ 
dpi dqt \dqj dt dpj dt dt ) 


= E 


'dF 9 , 

( dG\ 

dG a 

fdF\ 

, dG 
^ + dpi 

— i 

f—) 

. 3 qi dpi 1 

Kdi) 

dqi dpi 

\dt j 

dqi 

V dt ) 


dF d 
dpi dq x 
dG 


F, 


dt 


+ 




If both 1 1 = F as well as I 2 = G are invariants of motion, i.e., if dF/dt =0 and 
dG/dt = 0, we conclude 

d 

— [F,G] = 0. (19.32) 

dt 

With 73 = [F, G] we have then found another, possibly trivial, invariant of the system. 

We remark that Poisson’s theorem in the form of (19.32) only applies for invariants 
F and G, whose total time derivatives vanish identically. 

In case that dF/dt = 0 and dG/dt = 0 represent only implicit functions, we cannot 
infer that the Poisson brackets [dF/dt,G] and [ F , dG/dt ] vanish. The reason is that 
the construction of a Poisson bracket does not constitute an algebraic but an analytic 
operation. In the latter case, we must impose the stronger condition that the partial 
derivatives of dF/dt and dG/dt with respect to the qi and the pi all vanish 


dF 
dqi \ dt 



3 / 

( dF\ 

r dF 1 

0 , 

— 

—1=0 =► 

—, G 


3 Pi 

{dt ) 

dt 


= 0 . 


EXAMPLE 


19.9 Invariants of the Plane Kepler System 

The Hamiltonian for a plane and time-independent Kepler system is given by 
H (qj’Pi) = \{p\ + Pl) ~ f r = fqf+f, n = G(mi+m 2 ). 

















19 Canonical Transformations 


381 


Herein, G denotes the gravitational constant, and mi, m 2 the masses of the respective 
bodies. The canonical equations are obtained as 


dH 

Vi = - 7 — =Pi , 
dPi 




The angular momentum D = qip2 — qiPl constitutes an invariant of all systems with 
central force fields. We verify this by directly calculating the time derivative of D and 
subsequently inserting the canonical equations 


dD 

dt 


= q\P 2 + q\P 2 -qip\ -<np\ 


11 11 

= + P\P 2 + -$q 2 q\ ~ P 2 P 1 


= 0. 


Another invariant of this system is given by 

2 q 1 

Rl =q\p 2 ~ q2P\P2 ~ P' — • 
r 

We convince ourselves of this fact again by direct calculation of the time derivative 
of Ri 

dR\ . 2 . 0 • <71 

—— =qip 2 + 2qip2P2-qipipi-qipipi-q2P\P2 - p— 

dt r 

q 1 

+ li-r(qiqi -\-q2q2) 
r 5 

= P1P2 - ^qmpi - P1P2 + ^qunpi + 

- ^p\{q\+ql) + ^qiiqipi + <i2P2 ) 

= 0 . 


According to Poisson’s theorem, the function R 2 = [D, R\] then represents another 
invariant of the system 


Def dD dR\ dD dR\ dD dR\ 

R 2 = [D,R { ]= ---- +-- 

dqi dpi dpi dqi dq 2 dp 2 




q\ 


~q2P2 ~b q2 1 P 2 - 


dD dRi 
dp 2 dq 2 


- pi(2qip 2 - q 2 Pl) + qi 


(piP2 - ^qiq^j 


2 q 2 

= q 2 Pi -q\P\P2- li — • 
r 
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We can prove this easily by directly calculating dR^/dt. The invariants R\ and R 2 
constitute the components of the Runge-Lenz 2 vector. We will get back to the Runge- 
Lenz vector in Example 21.21. 


2 Carl David Tolme Runge, German mathematician and physicist, b. August 30, 1856, Bremen, 
Germany-d. January 3, 1927, Gottingen, Germany. Runge came from a family of merchants and 
grew up in Havana and Bremen. He took up studies of literature at Munich, but soon switched to 
mathematics and physics. As a student in Munich, he met Max Planck, which was the beginning of 
a lifelong friendship. Runge finished his studies with a thesis on differential geometry, supervised 
by Weierstrass, and became a professor of mathematics in Hanover in 1886. In 1906, he took up a 
professorship in Gottingen. Runge worked on the numerical solution of equations—the Runge-Kutta 
method for the solution of differential equations is named after him—and on spectroscopy. He did 
spectroscopical measurements himself and contributed eminently to the understanding of the spectral 
series of various atoms. Runge applied his results to the new field of the analysis of stellar spectra. 
In a textbook on vector analysis, Runge described the derivation, originally found by Gibbs, of con¬ 
served quantity of the Kepler problem. This discussion was then referred to by Wilhelm Lenz in his 
early quantum mechanical treatment of the hydrogen atom. The corresponding conserved quantity 
has become known as the Runge-Lenz vector. 

Wilhelm Lenz , German physicist, b. February 8, 1888, Frankfurt am Main, Germany-d. April 
30, 1957, Hamburg, Germany. Lenz attended the same school in Frankfurt as Otto Hahn, and took 
up studies of mathematics and physics in Gottingen in 1906. He obtained his Ph.D. in 1911 with 
Arnold Sommerfeld in Munich and became Sommerfelds assistant. In 1921, Lenz became professor 
of theoretical physics in Hamburg. Among his students and assistants in Hamburg were Pascual 
Jordan, Wolfgang Pauli, and Hans Jensen, who was awarded the Nobel Prize in physics in 1963 for 
the development of the shell model of the atomic nucleus. Lenz’ contributions to the early quantum 
mechanics of hydrogen-like atoms renewed interest in the Runge-Lenz vector, which, actually, had 
been known long before. A simple model for the description of ferromagnets developed by Lenz and 
proposed as a thesis topic to one of his students is well known today by the name of the student: the 
Ising model. 






In the preceding chapter, we tried to perform a transformation to coordinate pairs 
(q t , pi = Pi ) for which the canonical momenta were constant. We now proceed one 
step further and look for a canonical transformation to coordinates Pi = pm and 
Qi = q i0 which all are constant and are given by the initial conditions. When we have 
found such coordinates, the transformation equations are the solutions of the system 
in the normal position coordinates: 


qi = qi (<?(■<). pw, t ), pi = pi (qt o, Pi o, t). 


The coordinates ( Pi , Qi) obey the Hamilton equations with the Hamiltonian 
H'(Qi, Pi , t). Since the time derivatives vanish by definition, we have 


Pi= 0 = 


dH' 

Wi’ 


Qi 



( 20 . 1 ) 


These conditions would certainly be fulfilled by the function H' = 0. In order to 
perform the coordinate transformation, we need a generating function. For histori¬ 
cal reasons—Jacobi made this choice—we adopt among the four possible types the 
type F 2 — S(qi, Pi,t ), which already has been treated in the preceding chapter. It is 
generally known as the Hamilton action function. For this choice the equations (19.12) 
hold. We now require that the new Hamiltonian shall identically vanish. Then 


ds ( ds ds , 

— + H[q u ...,q n ; pi = p n = —;t =0. 

at \ oq i oq n 


Writing down this equation with the arguments, we obtain 


dS(qi,Pi=Pi,t ) 
dt 


. ds ds . 

+ H ( q \,..., q n ; -—,..., -—; t ) — 0 . 
dqi dq n 


( 20 . 2 ) 


(20.3) 


This is the Hamilton-Jacobi differential equation} The Pi denote constants that, as 
noted above, are fixed by the initial conditions pm. By means of this differential equa¬ 
tion we can determine S. We note that this differential equation is a nonlinear partial 


1 Carl Gustav Jacob Jacobi , b. Dec. 18, 1804, Potsdam, son of a banker-d. Feb. 18, 1851, Berlin. 
After his studies (1824), Jacobi became a lecturer in Berlin and in 1827 to 1842 held a chair as a pro¬ 
fessor in Konigsberg (now: Kaliningrad). After an extended travel through Italy to restore his weak 
health, Jacobi lived in Berlin. Jacobi became known for his work Fundamenta Nova Theoria Functio- 
rum Ellipticarum (1829). In 1832, Jacobi discovered that hyperelliptic functions can be inverted by 
functions of several variables. Jacobi also made fundamental contributions to algebra, to elimination 
theory, and to the theory of partial differential equations, e.g., in his Lectures on Dynamics (1842 to 
1843), published in 1866. 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_20, © Springer-Verlag Berlin Heidelberg 2010 
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differential equation of first order with n + 1 variables qi, t . It is nonlinear, since H 
depends quadratically on the momenta that enter as derivatives of the action func¬ 
tion with respect to the position coordinates. There appear only first derivatives with 
respect to the qi and the time. 

To get the action function S , we have to integrate the differential equation n + 1 
times (each derivative dS/dqt, dS/dt requires one integration), and we thus obtain 
n + 1 integration constants. But since S appears in the differential equation only as a 
derivative, S is determined only up to a constant a ; i.e., S = S' + a. This means that 
one of the n + 1 integration constants must be a constant additive to S. It is, however, 
not essential for the transformation. We thus obtain as a solution function 


S = S(q\,... ,q n \ p\..., p n \t). 


where the Pi are integration constants. A comparison with (19.12) leads to the require¬ 
ments 


Pi 


dS 

Qi = — 

BP, 


BS(qq n \ fi\, , fin, 0 

9ft 


(20.4) 


The Pi, oti can be determined from the initial conditions. 

The original coordinates result from the transformation equations (19.12) as fol¬ 
lows: From 


_ dSffji Pj11) 

ai ~ Wi 

follow the position coordinates 


qi = q t (aj, Pj, t). 


Insertion into 


Pi = 


dSjqj, Pj,t ) 
dqi 


= pi(q i9 pi, t) 


finally yields 


Pi = Pi {oti, Pi, t). 


Now the qi(aj, pj, t) and Pi{otj, pj, t) are known as functions of the time and 
of the integration constants aj, Pj . This simply means the complete solution of the 
many-body problem characterized by the Hamiltonian H(qi, pi,t). 

We can separate off the time dependence in S. If H is not an explicit function of 
the time, H represents the total energy of the system: 



E. 


From this, it follows that S can be represented as 


(20.5) 


S(qi, Pi,t) = S 0 (qi, Pi) - Et. 


To explain the meaning of S, we form the total derivative of S with respect to time: 
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But, since P t = 0, we have 

dS(q,. Pi = Pi, t) _ 
dt 



Because 




it further follows that 


dS(qi , Pf (Pa , C[a), t) 
dt 


D Piqi - H(cji, pi.t) = L(qt, pi, t). 


( 20 . 6 ) 


H and L are not bound by restrictions; in particular they can be time-dependent. This 
means that S is given by the time integral over the Lagrangian: 



(20.7) 


Since this integral physically represents an action (energy • time), the term action 
function for S is obvious. The action function differs from the time integral over the 
Lagrangian by at most an additive constant. However, this last relation cannot be used 
for a practical calculation, since as long as the problem is not yet solved, one does not 
know L as a function of time. Moreover, L(qt , pi , t) in (20.6) depends on the original 
coordinates qi , pi , while the ^-function is needed in the coordinates qi , Pi ( q a , p a ). 

Equation (20.7) is not unknown to us: The action function S turned up before when 
formulating the Hamilton principle (18.25). Before further continuing this discussion, 
we will illustrate the Hamilton-Jacobi method by an example. 

EXAMPLE 

20.1 The Hamilton-Jacobi Differential Equation 

We start again with the harmonic oscillator. The Hamiltonian is 


The Hamilton action function then has the form (compare (19.12) and (20.3)) 

dS 

S = S(q,P,t) and p = —. 

dq 

From this, we obtain the Hamilton-Jacobi differential equation: 



For solving the problem, we make a separation ansatz into a space and a time 
variable. A product ansatz would not work here, since the differential equation is not 
linear. We therefore set a sum: 


S = S l (t)^S 2 (q). 
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For the partial derivatives, we then get 


dS _ dS 2 (q) dS _ dSi(t) 
d q dq ’ dt dt 

This leads to 


-Si(t) = 


2m 


dSjjq) 

dq 



where ft is the separation constant. (The left-hand side depends only on the time t , 
the right-hand side only on the coordinate q : Therefore, both sides can only be equal 
if they are equal to a common constant ft.) For the time-dependent function, we then 
have 




which leads to 




For the space-dependent part, there remains the following equation: 


2m \ dq J 


dq 


As sum of the two parts, we then obtain for S 


S(q, /3,t) = Vmk J y — q 2 dq — fit 


For the constant Q = a, we then have 

dS Vrnk r/2p 


a = Q = — = 

d/3 k 


m 


- 1/2 


dq — t. 


The integral can easily be evaluated, and we obtain 


Q + t = A / — arcsm 

K 


in(V-fc/(2/3)^). 


With the usual abbreviation co 1 = k/m, we obtain the equation 


' ^sin co(t + Q). 
k 


A comparison with the known equation of motion of the harmonic oscillator shows 
that corresponds to the total energy E , and Q to an initial time to. Energy and time 
are therefore canonically conjugate variables. Both the energy and the time to (which 
corresponds to an initial phase) are given by the initial conditions. 
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The separation of the Hamilton-Jacobi equation represents a general (often the only 
feasible) way of solving it. If the Hamiltonian does not explicitly depend on the time, 
then 


dS 

n + H (*•■ 


• ? qn 5 


as 

dqi' 


dS 


= 0 , 


( 20 . 8 ) 


and the time can be separated off immediately. We set for S a solution of the form 


S = So(q i9 Pi)-Pt. 


The constant P then equals H and normally represents the energy. After this separa¬ 
tion, there remains the equation 


, ds 0 as 0 \ 

H[q\,... ,q n ; ——,..., —— \= E. 
oq i 3 q n . 


To achieve a separation of the position variables, we make the ansatz 


(20.9) 


So(qi,...,q n ; P\,...,P n ) = y ^Sjjqi, P t ) = S\(q\, Pi)H- S n (q n , P n ). (20.10) 

i 


This means that the Hamilton action function splits into a sum of partial functions Si , 
each depending only on one pair of variables. The Hamiltonian then becomes 


, d S i d Syi , 

dq i dq n , 


( 20 . 11 ) 


To ensure that this differential equation also separates into n differential equations for 
the Si ( qi , Pi), H must obey certain conditions. For example, if H has the form 


H(q\,.. .,q„, pi,p n ) = p\) H- \-H n (q n ,p n ), (20.12) 


the separation is certainly possible. A Hamiltonian of this form describes a system of 
independent degrees of freedom; i.e., in (20.12) there are no interaction terms, e.g., of 
the form H(qi , pi , qj , pj), which describe an interaction between the i th and the jth 
degree of freedom. 

With (20.12), (20.11) reads 


Hi 




= E. 


(20.13) 


This equation can be satisfied by setting each term Hi separately equal to a constant 
Pi ; hence, 

.... H n (q n ,^-)=p n , (20.14) 

\ dq 1/ V dq n ) 

where 


P\ + Pi + • • • + Pn — E . 


(20.15) 


Thus, there are n integration constants Pi in total. 
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Since the kinetic energy term of the Hamiltonian involves the momentum pi = 
dSi /dqi quadratically, these differential equations are of first order and second degree. 
As solutions, we then obtain the n action functions 

S i =S i (q i ,p i ), (20.16) 


which, apart from the separation constants fa , depend only on the coordinate qi . Ac¬ 
cording to (19.12), St immediately leads to the conjugate momentum pi = dSi /dqi to 
the coordinate qi. The essential point is (see (20.12)) that the coordinate pair (qi, pi) 
is not coupled to other coordinates (qk, Pk, i / k), so that the motion in these coordi¬ 
nates can be considered fully independent of the other ones. 

We now restrict ourselves to periodic motions and define the phase integral 


Ji = 



(20.17) 


which is to be taken over a full cycle of a rotation or vibration. The phase integral has 
the dimension of an action (or of an angular momentum). It is therefore also referred 
to as an action variable. If we replace the momentum by the action function 

/ dSi 

Ji = ® —- dqi, (20.18) 

/ dqi 

we see from (20.16) that Ji depends only on the constants fa, since qi is only an inte¬ 
gration variable. We therefore can move from the constants fa to the likewise constant 
Ji and use them as new canonical momenta. Hence, one performs the transformation 

A = Ji(fa) -> fa = fa(Ji). 


The total energy E which corresponds to the Hamiltonian can also be recalculated 
by (20.15) to the J^: 

n 

H = E = J2PiOi)- (20-19) 

1 = 1 


The Hamiltonian is therefore only a function of the action variables, which take the 
role of the momenta. All corresponding conjugate coordinates are cyclic. The con¬ 
jugate coordinates belonging to the / z are called angle variables and are denoted by 
{pi. The generating function Sfaqt, fa) turns with fa(Jk) into S(qi, Ji). The / z are the 
new momenta. We therefore can apply (19.12), and for the related new coordinates, 
we have 


<Pj = 


dS(qi, Ji) 


By transforming to the action variables and angle variables, we thus have performed 
a canonical transformation, mediated by the generating function 


Si (qi , fa) > Si (cpi , Ji). ( 20 . 20 ) 

This transformation from one set of constant momenta to another set actually does not 
give new insights. The meaning for periodic processes lies in the angle variable cpi. 
Since we performed only canonical transformations, we have 

dH 

cpi = -= Vi (Ji) = constant. 

3 Ji 


( 20 . 21 ) 
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One can show that v\ is the frequency of the periodic motion in the coordinate i . This 
relation thus offers the advantage that the frequencies, which are often of primary 
interest, can be determined without solving the full problem. We briefly demonstrate 
this point by the following example: 

EXAMPLE 


20.2 Angle Variable 


We again consider the harmonic oscillator. The expression for the total energy 



is transformed so that we get the representation of an ellipse in phase space: 

2 2 

2 mE 2E/k 



Fig. 20.1. Ellipses in phase 
space 


The phase integral is the area enclosed by the ellipse in phase space: 


J = 


pdq = nab. 


The two half-axes of the ellipse are 


a = \flmE and b = 



We therefore obtain 


J = 2nE 



or 


E = 




This leads to the frequency 
_ dH _ 1 
dJ 2n 



EXERCISE 


20.3 Solution of the Kepler Problem by the Hamilton-Jacobi Method 

Problem. Use the Hamilton-Jacobi method for solving the Kepler problem in a cen¬ 
tral force field of the form 
K 

V(r) =-. 

r 
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Solution. We adopt plane polar coordinates (r, 0) as generalized coordinates. The 
Hamiltonian reads 


H = — 
2 m 


P r + 



K 

r 


( 20 . 22 ) 


H is cyclic in 0, and hence, /?© = constant = l. The pi can be expressed by the Hamil¬ 
ton action function S : 

ds ds ds 

Pi = — =>■ P, = — • P@ = T 77 = constant = #>• 

aqi dr 90 

Thus, we obtain the Hamilton-Jacobi differential equation 



For the action function, we adopt a separation ansatz 


S = S 1 (r) + S 2 (0) + S 3 (O, 


(20.24) 


which is inserted into (20.23): 

J_(/ 9Si(r) \ 2 1 / 9S 2 (Q) \ 2 ] _ K = 3S 3 (Q 

2 m | \ dr ) r 2 \ 90 / J r dt 


(20.25) 


Equation (20.25) can be satisfied only if both sides are constant. The constant is the 
total energy of the system, because 



We remember that 


ds 3 

-= constant = = E. 

dt 


Pi=< 


Qt 


ds _ ds 
dPi~Wi 


(20.26) 


where cii , Pi are constants that follow from the initial conditions. 
We insert (20.26) into (20.25), and solve for 95*2/90: 



2 mp 3 + 



(20.27) 


The same argument that led us to (20.26) now yields 


dS2 

90 


ds 2 m t t p 

-= constant = p 2 , 

c/0 


and therefore, 


9Si _ dSi(r) 
dr dr 


12 mp 3 + : 


2m K 


Pi 


(20.28) 


r 


(20.29) 
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The Hamilton action function can now be written down as follows: 


s =/ 


I Imp's H- dr + /?20 — ft? t. 

r r z 


(20.30) 


We now define ft 2 and P? as new momenta P© and P r . The quantities Q[ conjugate 
to the P[ are also constant. 


Qr = 


_ ds _ d r 

~dp?~wJ 1 


1 2mP 3 + 


2m K 


- dr — t = a?, 


(20.31) 




ds 

Wi 


' 2mf J >) + 


2m K p, 


-# dr + 0 =a 2 . 


(20.32) 


If we identify a 2 with ©', which follows from the initial conditions, we obtain 
p 2 dr 


/ 


^yjlmp? + 2 mKjr — P^/r 


= 0 - 0 '. 


Insertion of the constants and substitution u = l/r leads to 
du 


/ 


y/(2mE)/l 2 * ) + (2otA'm// 2 ) - m 2 


= ©-©'. 


(20.33) 


(20.34) 


This integral of the form 
dx 


/ 


V ax 2 + for + c 

can according to integral tables be written as a closed expression, with 


, 2mE 4 m 2 K 2 

A = 4ac - b 2 = -4— - 77 — < 0, 


0 = © ; + arcsin ( 


l 2 l 4 

-2u + (2mK/l 2 ) 


) 


O r = 


= ©' — arcsin 
l 2 


VV(4w// 2 )(2£ + (mA -2 // 2 ))/ 

/ (l 2 u/mK) — 1 \ 

l v / l+(2H 2 /m^ 2 )/ 

1 


^ (1 + © + (2£7 2 /mA' 2 )cos(© - ©' + nr/2)) ’ 


m 


(20.35) 

(20.36) 


This is the solution of the Kepler problem, known from the lectures on classical me¬ 
chanics. 2 The types of trajectories follow from the discussion of conic sections in the 
representation r = p/(l + s cos (p): 


s=l=E=0 
£ < 1 = E < 0 
e > 1 = E > 0 


parabolas; 

ellipses; 

hyperbolas. 


2 See W. Greiner: Classical Mechanics: Point Particles and Relativity, 1st ed., Springer, Berlin 

(2004), Chapter 26. 
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Equation (20.31) could be rewritten further, by pulling the differentiation into the 
integral and transforming the resulting equation in such a way that the position r 
becomes a function of the time. We skip that here. 


EXERCISE 


20.4 Formulation of the Hamilton-Jacobi Differential Equation for Particle Mo¬ 
tion in a Potential with Azimuthal Symmetry 


Problem. Let a particle of mass m move in a force field that in spherical coordi¬ 
nates has the form V = — K cos 0/r 2 . Write down the Hamilton-Jacobi differential 
equation for the particle motion. 

Solution. We first need the Hamiltonian operator as a function of the conjugate mo¬ 
menta in spherical coordinates. For this purpose we first write the kinetic energy T in 
spherical coordinates: 

r = re r + r©e© + r sinQ^e^ (20.37) 

=> T = -mr • r = -m(r 2 + r 2 © 2 + r 2 sin 2 0<p 2 ). (20.38) 

The Lagrangian then reads 

L = T -V = ^m(r 2 + r 2 © 2 + r 2 sin 2 <p 2 ) - V(r, ®,<p). (20.39) 

We now assume that V (r, ©, <p) is velocity-independent (which is indeed fulfilled) 
and form the canonical conjugate momenta: 

3 L . 3 L 2 • 3L 2-2 

p r = — = mr, ps = —- = mr 0, /a = — = mr sin © 99 . (20.40) 

3r 30 dip 

From this, we obtain 

. Pr A P® . Pip 

m mr z mr 2 shr 0 

Hence, H can be given in the desired form 


H — 'y \ pgCjoL L 


A 1 //? 2 p% 

= p r r + p @0 + Py(p - -m —J + 2 2 + 


Pi 


2 2 


pI 


2 V m 2 r 2 m 2 m 2 r 2 sin 2 0 
+ V(r, 0, <p), 


2 m 2mr 2 2mr 2 sin 2 0 
and for the actual potential (see the formulation of the problem), 

~ 2 ~ 2 " z cos 0 


+ V(r,S,<p) 

(20.41) 


H = — ^ + 


pi 


2 m 2m r 2 2m r 2 sin 2 0 


(20.42) 
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The pi as functions of the Hamilton action variables read 


Pr = 


dS 


ds 


p Q = 


Pip = 


ds 


dr 9© ^ dip 

Therefore, the Hamilton-Jacobi equation has the form 


dS 1 J/9S\ 2 1 / dS x2 

dt 2m I V dr ) r 2 V 9© 


+ 


1 


ds N 


r 2 sin 2 © \dip 


(20.43) 


K cos© 

- =—=0. (20.44) 


EXERCISE 


20.5 Solution of the Hamilton-Jacobi Differential Equation of Exercise 20.4 


Problem. 

(a) Find the complete solution of the Hamilton-Jacobi differential equation from the 
preceding Exercise 20.4, and 

(b) sketch how to determine the motion of the particle. 

Solution, (a) The approach is analogous to Exercise 20.3. We adopt the separation 
ansatz for S , 


S = S 1 (r) + S 2 (®) + S 3 (<p)-Et, 
and insert this into (20.44): 


+ ■ 


2 m \ dr J 2 mr 2 \ 9© J 2 mr 2 sin 2 0 \ dip J 


d_S3\ 2 _ 


K cos© 


r I 


2 ^95i(r)y 


2 mEr 


fds 2 (®)\ 

1 _ 1 1 


V 9© ) 

' sin 2 © 1 

l d<P ) 


+ 2/77 K cos©. 


(20.45) 

= E 

(20.46) 

(20.47) 


Equations (20.46) and (20.47) can only be satisfied if both sides are constant: 

2 


2(*MlY_ 2mEr 2 


\ dr 


- 


2 mEr = constant = /fi, 
2 


-4- +2 ^cos©^, 

l 30 j sin 2 © \ dtp J 
To separate © from <p, we multiply (20.49) by sin 2 ©: 

fdS 3 (<p)\ 2 „ „ „ . o . 2r ^ fdS 2 (&)\ 2 . 2 r\ 

I - = 2/77 K cos © sin © — pi sin © — - sin ©. 

V dip J \ a© / 


(20.48) 

(20.49) 


(20.50) 
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Exercise 20.5 


The separation constant is denoted by fa , since 

2 


dS 

— = Pfp and thus 
dip 


I «!<*'> \ .2 „2 

I,” ) =ft=V 


Therefore, 

2/77 K cos © sin 2 © — fa sin 2 © — sin 2 © (—^ = p (p . 

\ 3 © / 

Integration of (20.48), (20.51), and (20.52) yields 

/• 


(20.51) 


(20.52) 


Pi 


Si = I s j2mE + dr + c\, 


52 = J J 2m K cos 0 — pi — 

53 = (pPip+C3. 


sin 2 © 


J© 4“ r*2, 


(20.53) 


The complete solution of the Hamilton-Jacobi differential equation is obtained from 
(20.53) and the ansatz (20.45) for S : 


-f 


/■ 


2 mE + ^- dr + / x \2mK cos © — fa 


p 2 

sin 2 © 


J© 


+ <PP(p ~ Et + C. (20.54) 

(b) The explicit equations for the motion of the particle follow from the requirement 

dS dS 

Qi = - <£> Oil = -, 

dPi d^ 


since Qi , Pi are constants that are denoted by a?;, fa , and thus, 


dS 


dS 


dS 


= <* 1 , 


= Oi 2 , 


= Oi 3. 


dfa d E dp^ 

The OLi follow from the initial conditions; for example, 

m 


dS 

~dE 


-J 


•JlmE + /ii/r 2 


/ 


m 


- dr = J — 


yjlmE + fair 1 y 2 E J ^/ r 2 + fa/ 2 mE 


/ 


dr — t = c( 2 , 


: Jr 


(20.55) 


(20.56) 


/ mr 4 /3j 


2£ + 4£^ +C 


0(2 + t : 


mr z 




-I-1- c 

2£ + 4£ 2 + ’ 


(20.57) 


r (t = 0) = r 0 


&2-C- 


1™i _^i_ 

2£ + 4£ 2 ’ 
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and as the solution for t , 


Exercise 20.5 





2£ + 4£ 2 ' 


(20.58) 


The 35/3)0i and dS/dp^ are treated likewise. The evaluation of the elliptic integrals 
requires numerical methods. 

We come back once again to our discussion in the context of (20.8) and (20.9). If 
the Hamiltonian does not explicitly depend on the time, as is the case for conservative 
scleronomic systems, the Hamilton-Jacobi differential equation can be brought to a 
simpler form, since S can only linearly depend on t. We therefore transform to 

S = Sq — Et, 

where 5o = So(q — 1,..., q n , Pi, • • •, Pn)- One then obtains the so-called reduced 
Hamilton-Jacobi equation: 



The solution of this differential equation yields arbitrary constants, one of them, e.g., 
Pi, is additive (5o + c solves the above Hamilton-Jacobi equation too) and can be 
omitted. But the reduced Hamilton-Jacobi equation now involves the total energy, so 
that 5o will also depend on E , and therefore, in 

So = So(qi, ...,q n ,E,P 2 , • • . , Pn) 

Pi is replaced by E. We can express this in the following way: Just as in the original 
Hamilton-Jacobi equation, the reduced form also has n integration constants, one of 
them the total energy E. 


EXERCISE 


20.6 Formulation of the Hamilton-Jacobi Differential Equation for the Slant 
Throw 

Problem. Use the reduced Hamilton-Jacobi differential equation to formulate the 
equation of motion for the slant throw. 

Solution. Let the coordinates of the throw plane be x (abscissa) and y (ordinate), 
which will also be used as generalized coordinates. 


H = T + V = ^(x 2 + y 2 )+mgy , 


(20.59) 


dH 

Px = ~x~r =mx. 


dH 

Py = ^=my- 
dy 


(20.60) 


The conjugate momentum p x = mx to the cyclic coordinate x {dH/dx = 0) is a con¬ 
served quantity. We recalculate (20.59) as 



(20.61) 
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Exercise 20.6 


Since H does not depend explicitly on the time and the system is conservative, the 
reduced Hamilton-Jacobi differential equation can be applied. 


1 

2 m 


dSo 

dx 


r+m 


+ mgy = E. 


(20.62) 


By inserting the separation ansatz So = S\(x) + S 2 (y) into (20.62), we obtain 

2 /oc \ 2" 


1 

2 m 


m 


+ mgy = E 


or 


( a ^\ 2 = 2 mE - 2 ^ g y -( a - m '\ 


V 3 x 1 


V 3 y ) 


(20.63) 


(20.64) 


This is satisfied only if both sides of the equation are constant, since v and y are 
independent coordinates. 


3 Si (v) 
dx 


k. 


ra : 


(2m E — fo) — 2 m 2 gy. 


Integration yields the solutions 
S| (x) = Jfa + d, 


Si(y) = ~ 


[(2mE - fo) ~ 2 m 2 gy] 3 ^ 2 + ci. 


3m 2 g 


(20.65) 

( 20 . 66 ) 

(20.67) 


The complete solution of the reduced Hamilton-Jacobi differential equation is of the 
form 


So(x,y,E,p 2 ) = y/]hx 


1 


3 m 2 g 


\(2mE — P 2 ) ~ 2 m z gyy /z + c, 


( 20 . 68 ) 


9 S 0 


= t + a 1 , 


9 S 0 


= 


3 E ' ' 3/3 2 

where the first relation holds because 
_ 3Si _ 3S 0 

011 ~ Je ~ Je ~ 

From this, we obtain y = y(t) as 


- [(2m E — fc) — 2 m z gy] L/z = t + a\ 

mg 


2_, n l/2 . 

2„ _ 2 n 2 u , _ x2 


(20.69) 


2mE — P 2 ~ 2m gy = m g (t + c*q) 


1 


2m E — P 2 


o y = -~g(t + <x 1 ) + , 2 

2 2 m A g 


o y — --gt + c\t + C 2 . 


(20.70) 


In the last step, we renamed the constants. 
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The analogous procedure with 3S/3/32 = <22 yields Exercise 20.6 

y = —C\X 2 + C 2 X + C3, (20.71) 

i.e., the familiar throw parabola. For the case of the slant throw the Hamilton-Jacobi 
equation may appear clumsy for establishing the equation of motion. A certain advan¬ 
tage of the method shows up in complicated problems, e.g., in the Kepler problem in 
Exercise 20.3. 


20.1 Visual Interpretation of the Action Function 5 

In the preceding problems, the Hamilton-Jacobi differential equation proved success¬ 
ful for establishing the equations of motion, in particular for complex mechanical 
problems. There remains the question about the visual meaning of the action func¬ 
tion S. We consider the motion of a single mass point in a time-independent potential 
and write 


S = So(qi , Pi) - Et , 


where, as already indicated, So (qt , Pi ) describes a spatial field which is time- 
independent. With the labeling qi = x, q 2 = y, q?, = z and p\ = p x , P 2 = P y , P 3 = p z > 
for the momentum components we have according to (19.12) 

3 S 3 So 3 S d So 3 S d So 

12 ^ t \ ^ P V < \ ^ P Z t \ 

ox ox 3y Oy oz oz 

Written as a vector equation, this is 
p = gradS = VSo. 

Since grad S is always perpendicular to the equipotential surfaces of S, we realize that 
in a representation of the S-field by S = constant, the orbits are represented by trajec¬ 
tories orthogonal to this set of surfaces. Accordingly, to a given field S belong all mo¬ 
tions with trajectories perpendicular to the equipotential surfaces of S (S = constant), 
and moreover along each trajectory all motions starting at an arbitrary moment (see 
Fig. 20.2). 



Fig. 20.2. Surfaces S = 
constant. Trajectories are 
dotted 


The time behavior of the S-field can be seen from the representation S = So — Et. 
For t = 0, the surfaces S(qt, Pi) = 0 and So (qi , Pi) = 0 are identical. For t = 1, the 
surface S = 0 coincides with the surface So = E, S = E with So = 2E, etc. This 
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means graphically that surfaces of constant S- values move across surfaces of constant 
So-values, i.e., that surfaces of constant S move through space. The formal meaning 
of S follows from the action integral. One has 

J Ldt = J ( p x dx + p y dy + p z dz — Hdt), 

f }/ds ds ds ds \ 

I Ldt — II — — dx + —— dy + —— dz + —— dt \ — S 2 — S\. 

J J \dx dy dz dt J 

t\ t\ 

S therefore represents an action (energy • time). It is the time integral over the La- 
grangian. The Hamilton principle 8 J Ldt = 0 therefore states that a motion proceeds 
with the boundary condition of minimum action. 


EXAMPLE 


20.7 Illustration of the Action Waves 


To illustrate the action waves, we consider the throw or fall motion in the gravita¬ 
tional field of the earth, where the equation of motion is well known. In analogy to 
Exercise 20.6, we obtain the following Hamilton-Jacobi differential equation: 


^{(s) + G;) + (te) | 


ds 

+ mgz+ — =0. 
dt 


(20.72) 


With the separation ansatz S = S x (x ) + S y (y) + S z (z) — Et, we obtain 

Sx — X Px 5 Sy = yfiy 

up to additive constants, and 

1 f dS A\ z7 ti + Py 0 

+mgz = E --- =p z . 

2m V dz J 2m 


(20.73) 


The quantities and /3 y are separation constants, just like /3 Z . Integration over z 
yields, up to a constant, 


2/2 o /9 

S7 = -mgz) 3/2 . 


(20.74) 


3 gym 

We write the constant fi z as fi z = mgzo and thereby can express the total energy as 


zr £ + Py , 

E = —-- \-mgzo- 

2m 


By insertion, one gets the action function 
2mj2g 


S = x/3 x + y/3 y - 


L (zo -z) 3/2 - ^ 


2 m 


+ mgzo )t, 


(20.75) 


(20.76) 
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and by the familiar scheme the equations of motion, 


Qx = OLx = 

Qy = Oty = 
Qz = — 


$Px X m 
dS /3y 

Wy= y ~m h 

= ~m^2g(zo - z) l/2 
ozo 


— mgt. 


(20.77) 


Among the possible motions of a body in the gravitational field we pick the ensemble 
with p x = 0, P y = 0, zo = 0- From (20.76), we get for the surfaces with So = constant: 


constant =-2^1 (_ z) 3/2. 

These are planes parallel to the x, y -plane. 

The possible trajectories are shown in Fig. 20.3 by dashed vertical straight lines. 
Since the action function is real only for z < 0, there are only such throw conditions 
that ascend up to the plane z = 0 and then return again. In the present example, the 
action waves are planes parallel to the x, y-plane that propagate in z-direction. This 
is easily seen from (20.76) for zo = constant / 0. Any vertical throw up to the height 
zo thus belongs to the same S-field or to the same action wave, respectively. Here it is 
not essential at which space point the throw motion begins. 


Example 20.7 



As a further ensemble of motions, we consider 


Px = 0, zo = 0, 


, 2 m*/2g 

'y = —3—’ 


so that from (20.76) we obtain 


(20.78) 






y = 


2m v / 2g 

3 

3 

2 m^/2g 


{y - (-z) 3/2 l 

So + (—z) 3/2 . 


(20.79) 

(20.80) 


Equation (20.80) represents a Neil or semicubic parabola (y = ax 3 / 2 ) in the y, z-plane. 
The surfaces with ^o = constant are therefore surfaces parallel to the x -axis: F(y,z) = 
0, i.e., cylindrical surfaces intersecting the y, z-plane in a set of Neil parabolas with 
the top on the y-axis (Fig. 20.4). 
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Fig. 20.4. 



/ y 


S n -S- 


With increasing So the tops of the Neil parabolas move in the y-direction. The 
related trajectories are throw parabolas in the y, z-plane which have no velocity com¬ 
ponent along the x-direction and reach their highest point at z = 0 (dashed curves in 
Fig. 20.5). 


Fig. 20.5. Projection onto the 
y,z-plane of the preceding 
Fig. 20.4 



The velocity component in y-direction is the same for all throws: 



In this case, the action waves consist of cylindrical surfaces parallel to the x-axis that 
propagate with increasing time in z-direction. 

The starting point of the motion in the x, y-plane is again arbitrary; the turning 
points of the trajectories are at z = 0. All throw parabolas parallel to the x-axis belong 
to the same action wave; i.e., any throw described by (20.79) can be represented by a 
set of action waves propagating in the z-direction and parallel to the x-axis. 

We see from this example that the simple throw in the gravitational field can be cor¬ 
rectly represented by the Hamilton-Jacobi formalism but becomes hopelessly compli¬ 
cated. This confirms our thesis: Although the Hamilton-Jacobi method contains beau¬ 
tiful formal ideas, it is hardly practicable, too clumsy, and too abstract for physicists. 

EXAMPLE 

20.8 Periodic and Multiply Periodic Motions 

In this example, the peculiarities of periodic motions shall be compiled and extended 
to multiply periodic motions. 3 


3 Here, we follow A. Budo, Theoretische Mechanik, Deutscher Verlag der Wissenschaften, Berlin 


(1956). 
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1. Periodic motions: Here, one distinguishes two kinds, namely, the properly peri¬ 
odic motion , for which 


q,(t + r) = Chit), 

(20.81) 

Pi(t + t) = Pi(t), 

i.e., both the coordinates and the momenta have the same period x. This motion is 
also called libration. Two-dimensional examples are the (nondamped) harmonic os¬ 
cillator or the (nondamped) vibrating pendulum. The phase-space diagram (the phase 
trajectory) is a closed curve (see Fig. 20.6). 

The other type of periodic motion is the rotation. Here one has (e.g., in the two- 
dimensional case) 

p(q + qo) = p(q), (20.82) 

i.e., the momentum takes for q + qo the same value as for q. The coordinate q is 
mostly an angle variable and qo = 2n. One might imagine for example a circulating 
pendulum; in this case q is the pendulum angle. The phase-space trajectory is then not 
closed but periodic with the period qo (see Fig. 20.7). 



Fig. 20.7. Phase-space diagram of the rotation as a periodic motion. The trajectory is open but 
has the period qo- In other words, the momentum p is a periodic function of the coordinate q 
with the period qo 


The limiting case between rotation and libration is called limitation motion. The 
pendulum, which is almost circulating, is an example for this type of motion. The 
coordinate period qo is then qo = 2n as before, but the time period is r = oo. 
(The pendulum then comes to rest in the upper vertical position (unstable point), i.e., 
the function graph terminates at the point go-) If the system is conservative and is 
described by the Hamiltonian H(p,q), we have the equations 


H(p,q) = E , 



(20.83) 


The first equation yields p — p(q, E), i.e., for a given energy E the phase trajectory. 
The second equation is the (reduced) Hamilton-Jacobi equation from which the action 
function (generating function) F 2 (q, P) = S(q, E) can be calculated. If that is done, 
one can calculate the phase-space integral 


J = 




(20.84) 


Example 20.8 


P 



q 

Fig. 20.6. Two-dimensional phase 
diagram of a properly periodic 
motion. A closed phase tra¬ 
jectory occurs, e.g., for a non¬ 
damped vibrating pendulum 
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Here, f means the integration over a closed trajectory in the case of libration, or over 
a full period q\ < q < q\ + qo in the case of rotation. Hence, the phase integral J 
exactly corresponds to the shaded areas in Figs. 20.6 and 20.7 

The phase integral / = J (E) depends only on E and is constant in time, since the 
total energy is constant in time. Hence, (20.84) leads to the relations 

J = J(E) or E = E(J). (20.85) 

As a consequence, the function S(q, E) changes to 

S(q,E ) =► S(q,E(J)) = S'(q,J). (20.86) 

The function S'(q , J) can serve as the generator of a canonical transformation. The 
new momentum is now identified with /, i.e., 

P = J. (20.87) 

The canonically conjugate variable belonging to P = J is denoted by Q = (p. It is also 
called an angle variable and is calculated according to (20.4) as 

3S'(q, J ) 

(p= dJ- (20 ' 88) 

As S' does not explicitly depend on time, the transformed Hamiltonian H' is obtained 
by expressing the original Hamiltonian (20.83) in terms of the transformed variables 

H'((p,J) = E(J). (20.89) 

The Hamilton equations in the new coordinates then read 

. dH' . dE(J) 

Q = - or (p — -= constant, 

dP , U (20.90, 

dH' 

P = -or / = 0. 

9G 

dE(J)/dJ depends only on /, which is constant in time. Hence, cp is also constant in 
time, and then 

Here, the phase constant 8 appears. If we had not selected S'(q, J ) as the generating 
function but rather the complete time-dependent action function 

W(q , /, t) = S'(q , J ) - E(J)t , (20.92) 

the coordinate conjugate to J would be according to (20.88) 

dW(q,J ) dS'(q,J ) dE(J ) # 

dJ dJ dJ 1 

dE(J) 

= <p-—±j 1 t = 8, (20.93) 


i.e., just the phase constant from (20.91). Equation (20.91) states that the angle 
variable (p linearly increases with the time. It is a cyclic coordinate as is evident 
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from (20.89), since the Hamiltonian H\cp, J) = E(J ) does not depend on ip. The 
change of <p during a period r is found from (20.91) to be 


Acp = 



T, 


which can be specified more precisely by means of (20.88). We have 


(20.94) 


A ip = 


3 <p 


3 q 

= -i 

3 J J 


dq = 


9 2 S'(q,J) 
dqdJ 


dq 


3 S'(q,J) 3 J 

- dq = — = 1. 

3 q H 3 J 


(20.95) 


Hence, the angular coordinate increases during a period after which the system returns 
to its initial configuration, exactly by 1. We therefore can state that the motion of the 
system is periodic in ip with the period 1. Combining (20.94) and (20.95) yields 


T 


3 E 

37 


= 1 




3 E 

JJ 



(20.96) 


v is the frequency of the periodic motion. Obviously the complete solution of the 
equations of motion is not needed for calculating v. It is sufficient to express E as a 
function of / and to differentiate with respect to J . This is the advantage of introduc¬ 
ing the action (/) and angle variables (ip). The approach is illustrated in Example 20.2 
for the case of the harmonic oscillator. 


2. Separable multiply periodic systems: We imagine a conservative system with 
/ degrees of freedom, which is described by the / coordinates q \,..., qp and is 
separable. This means that the solution of the reduced Hamilton-Jacobi equation 


f 3 S 3 S 


(20.97) 


Example 20.8 


can be written in the form 


S(qu...,qf,E,P 2 ,...,p f ) 

= Si(qi\E,fo,...,Pf) + --- + Sf(q f ;E,fo,...,Pf). (20.98) 

The / integration constants 

£, £ 2 ,(20.99) 

characterize the constant momenta P \,..., Pf. If the Hamiltonian decomposes into 
a sum of terms H(qi,pi ), only one constant appears in the functions Sk(qk\ E , 
f $2 in (20.98); the functions then have the form Sk(qk, Pk )—see (20.10). 

When can we classify a motion as periodic? The answer is simple: If any pair of 
conjugate variables (qt, pi) always behaves as discussed in the first part of this ex¬ 
ample, the motion is periodic. More precisely: The projection of the phase trajectory 
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onto each qi,pi -plane of the phase space must be either a libration or a rotation, to 
guarantee the periodicity of the entire motion of the system. 

The procedure is analogous to that outlined in the first section. First one defines the 
action variables 

Ji = (J) pidqi = j) dqi 

= Ji(E, fc, ■■■, Pf), i = \,..., f. (20.100) 

They are constant in time, since E, £ 2 , ■ • •, £/ are constant. The / equations (20.100) 
can be solved for E, £ 2 ,..., £/ and yield 

E = E(J u ...,J f ), 

Pi = , 7/), 

(20.101) 


By inserting (20.101) into (20.98), we obtain 

Si( qi , E(J k ), £2(4), • • •, £/(/*)) = S\ qi , 7i, • • •, 7/). (20.102) 

This is a generating function with the constant momenta 

Pi = Ji. (20.103) 

The relation (20.102) is fully analogous to the relation (20.86), and (20.103) corre¬ 
sponds to (20.87). The canonically conjugate angle variables result—like (20.88) — 
from 

dS' J^dS' t (q k ,Ji,...,Jf) 

w = ?rr = E V; ’ = ( 20 . 104 ) 

97/ 97/ 

Among the canonical variables (g/, F/) = (<p/, 7/) is the Hamiltonian 

H'(<pi,Ji) = E(Ji), (20.105) 

since the Hamiltonian is independent of time (see (20.83)). From this follow the 
Hamilton equations 

dH' dE(J k ) 

c pi =-=-= constant = v/, 

9P S ^ (20.106) 

dH' 

Ji =-= 0; 

9<Pi 

hence, 


C Pi = Vit + 8i, 

Ji = constant. 


(20.107) 
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We are now interested in the change of the angle variables (pi over a period (full rev¬ 
olution or back-and-forth motion of a coordinate qi with the remaining coordinates 
kept fixed). It is given by 


A m 


/ 3 <Pi , 

(b —dq k = 

J dqk 


i 


d 2 S' 

dJidqk 


dqk 



(20.108) 


According to (20.107), 


Example 20.8 


A k<Pk = v k r K , (20.109) 

if z k is the “vibration time” (time interval of the period) of q k . A comparison 
of (20.109) and (20.108) yields 


vjfcr* = l. (20.110) 

Thus, 

v k = — (20.111) 

n 

obviously are the frequencies of the ^-motion. In other words, according to (20.106) 
the (fundamental) frequency v k for the coordinate q k is v k = dE(J \,..., Jf)/dJ k . 

Equations (20.104) can also be inverted, which yields the original coordinates q n 
with 


Q k = Q k (<Pl,---,Vf)> k = l ,...,/ (20.112) 

as functions of the new angle variable (pi. When increasing <pi by A (pi = 1 (keeping 
the values of all other <p k with k ^ i fixed), qi (and only this!) must run through a 
period. This follows from (20.108): If q k (with k^i) ran through a period when <pi 
changes to (pi + A (pi = <p t + 1, then according to (20.108) the variable (p k also should 
increase by A(p k = 1. But this shall not occur by assumption. Therefore if (pi increases 
to (pi + 1, qi changes as follows: 


<Pi “► Vi + 1 > 

qi -> qi for libration, 

Qi —> qi + qiO for rotation. 

For a libration, qi is periodic; for a rotation, 
Qi ~ Vi QiO 

is a periodic function of (pi. Actually, 


(iPi Vi + 1, 

Qi ~~ ViQiO Qi H - QiO — (.Vi T 1)^0 = Qi — ViQiO- 


(20.113) 


(20.114) 


(20.115) 
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We therefore can expand the separation coordinates qi (for libration) or qi — cpiqu) (for 
rotations) in a Fourier series and write 


qi -m/oOpiO), 


+00 

+'V 2;r¥,i 


n=—oo 


+00 

E 


a (i) e i2nn{vit+8i) 


n=—o o 


(20.116) 


where 

l 

■■■, <Pi-l,<Pi+l, ...,<Pf) = J qi(<pu ■■■, (pf)e~ l2Kn<Pi d<pi. (20.117) 
0 

The Fourier coefficients (cp \,..., cpi-i, (pi +\,..., (pf) in general still depend on all 
angle variables, except for (pi. 

We now imagine other variables xi which describe the system and are useful for 
certain problems. They shall unambiguously depend on the qi(t) and therefore are 
also functions of the time. Then we can write 


0 i2n(ni(pi+-+nf(p f ) 


xi(qi(t),...,q f (t)) 

+00 +00 

= E - E 

n\=—oo rif=—o o 

00 

_ y ' ^(/) -h nfVf)t+(8 n ^+...+8 n j.)] ^^ 

n\ f=—o o 


(20.118) 


In the second step, we used cpi =vrf + 8 j . The coordinates jt/ can be represented only 
by a multiple Fourier series. Equation (20.118) now suggests that the motion xi(t) is 
in general not periodic in time. For example, if t increases by At = \/v\ , the first 
exponential factor in (20.118) does not change because e l2jrn i v i At = _ 

e i27zni _ the other exponential factors in (20.118) vary. The system is therefore 
called multiply periodic in the coordinates jt/. If the system is simply periodic , the 
frequencies v\,... ,vy must be correlated by / — 1 equations of the type 

Ci ivi + c i2 v 2 + • • • + CifVf =0, i = 1 ,...,/- 1. (20.119) 

These are / — 1 equations for the / unknown quantities vi,..., Vf. It is evident that 
besides v/, also vv/ (v an arbitrary factor) is also a solution of (20.119). Let v/ = 
ni/mp thus, the v/ can be represented by the fraction ni/mi. Then 

/ n i 

y. = (m\m 2 • • • m f)vi = (m\ • • • m /)— (20.120) 

mi- 

are also solutions of (20.119). Hence, the v- are integers. Since the solutions of 
(20.119) can be determined only up to a common factor v, the general solution reads 


v/ =aiV, 


( 20 . 121 ) 
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where the at = (m\m 2 • • • m f)rii/mi are integers, and v is a common factor. Thus, the Example20.8 

system is periodic if and only if all frequencies are commensurable. The fundamental 

frequency vo is then the largest common divisor of all frequencies v \,..., Vf. If there 

exist only s (with s < f — 1) relations of the form (20.119), s frequencies can be 

rationally expressed by the remaining ones. The system (the motion) is then called 

s-fold degenerate or (/ — s)-foldperiodic. Special cases are 

• s = 0: the motion is /-fold periodic or nondegenerate; 

• s = f — 1: the motion is single-periodic or fully degenerate. 


20.2 Transition to Quantum Mechanics 


In the last chapters, we have emphasized the formal aspects of mechanics. Although 
for solving practical problems sometimes no advantages could be achieved, the in¬ 
sights in the structure of mechanics provided by the Hamiltonian formalism con¬ 
tributed essentially to the development of quantum mechanics. For example, the con¬ 
cept of the phase integral was of fundamental importance for the transition to quantum 
mechanics. The first clear formulation of the quantum hypothesis consisted of the re¬ 
quirement that the phase integral take only discrete values ; hence, 


J = 


pdq=nh , n = 1,2,3,..., 


( 20 . 122 ) 


where h is Planck’s action quantum, which has the value h = 6.6 • 10 -34 J s. We again 
consider the case of the harmonic oscillator. In Example 20.2, we evaluated the phase 
integral 


J = 2ttE 



(20.123) 


v = (1/27 t)^/kjm was the frequency. With the quantum hypothesis, we then obtain 


E n = nhv. 


(20.124) 


Thus, the quantum hypothesis leads to the conclusion that the vibrating mass point 
can take only discrete energy values E n . For the motion, this means that only certain 
trajectories in the phase space are allowed. We therefore get ellipses for the phase- 
space trajectories (compare Example 20.2), whose areas (the phase integral) always 
differ by the amount h. In this way, the phase space acquires a grid structure that is 
defined by the allowed trajectories. 

Each trajectory corresponds to an energy E n . In a transition between two trajec¬ 
tories the mass point receives (or releases) the energy E n — E m = (n — m)hv. The 
smallest transferred amount of energy is given by hv. 


P 



Fig. 20.8. In quantum me¬ 
chanics, the phase-space tra¬ 
jectories of the harmonic os¬ 
cillator are ellipses that differ 
by an area of h 
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Since the action quantum h is so small, the discrete structure of the phase space 
is significant only for atomic processes. For macroscopic processes the trajectories 
in the phase space are so dense that one can consider the phase space as a contin¬ 
uum. The energy quanta hv are so small that they have no meaning for macroscopic 
processes. For example, the energy emitted in a transition in the hydrogen atom is 
hv = 13.6eV (electron volt). Expressed in the (macroscopic) unit of Watt seconds 
hv = 2 • 10 -18 Ws. The quantum hypothesis was confirmed by the explanation of the 
spectra of radiating atoms. 


EXERCISE 


20.9 The Bohr-Sommerfeld Hydrogen Atom 


Problem. At the beginning of the development of modern quantum mechanics, 
N. Bohr and A. Sommerfeld formulated a “quantization prescription” for periodic 
motions. Accordingly, only such trajectories in phase space are admitted for which 
the phase integral 

<j) p a dq a = n a h, n a =0,1,2,... (20.125) 

is a multiple of Planck’s action quantum h = 6.626 • 10 -34 Js. The integral extends 
over a period of motion. q a and p a are the generalized coordinates and the canonically 
conjugate momenta, respectively. 

(a) Write the Lagrangian, the Hamiltonian, the Hamilton equations, and the constants 
of motion for a particle in the potential v(r) = —e 2 /r. 

(b) Calculate the bound energy states of the hydrogen atom from the condi¬ 
tion (20.125). 


Solution, (a) The Lagrangian is L = T — V = (1/2 )mv 2 + e 2 /r. The Hamiltonian 
then follows as 

.3 L 1 9 e 2 1 .9 1 9 . 9 e 2 

H = x a —:- L = -mv z -= -mr z H— mr Z (p z - (20.126) 

3 JCty 2 r 2 2 r 

CM W 


in polar coordinates. The canonical momenta are 

3L 2 • n 3 L . 

Pep = — = mr (p = L and p r = — = mr. 
3 cp dr 

C is the angular momentum of the particle. 

Then 


H(p,q) = ^ + -^--. 

2m 2 mr z r 

Constants of motion are 

(i) H = E, since H(q, p) does not explicitly depend on the time; and 

(ii) py = C, since (p is a cyclic variable. 


(20.127) 


(20.128) 
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C represents the constant angular momentum. The Hamilton equations read 


dH „ 


dH 


e2 , pi 

Pr ~ ~ - 1 - 3 

or r mr J 


dH 

<P= = -2’ 

op^ mr z 

. dH p r 


dp r m 

(b) The quantization conditions for the angular motion are 

2n 

= (j) Pcpdcp = J Cdcp = 2 tt£ 


Ih 


c = ih , 


ft=—, / = 0,1, 2,..., 

2n 


(20.129) 


(20.130) 


i.e., the orbital angular momentum can take only integer multiples of H. For the radial 
motion , the phase integral equals 


kh = 


= (j) p r dr = 2 J y 2m + 


e 2 \ C 2 

— )- y dr, k — 0, 1, 2,.. 

r J r z 


(20.131) 


The limits of integration are determined from the condition 
Pr =0; 
thus, 

C 2 


2 e 
r m + jr, 


2m E 
e 2 

=F 


2 E 4m E 

The integral in (20.131) is of the type 


= 0, 

with A = —4m(2EC 2 + me 4 ). 


/ 


\/ ar 2 + hr + < 


dr = 


=/ 




Jr, 


r Jr 

and one easily verifies by differentiation that 

Here, A(r) = ar 2 + br + c. Furthermore, 
dr 


dr 




[dr 1 (2 ar + b\ 

/ , =-== arcsin — , for a < 0 (since E 

J yfitjr) V V^A / 


< 0 ) 


and 


/ 


Jr 




1 (br 4~ 2c\ 

-== arcsin 


'—c 


r \/—a y 


) for c < 0 and A < 0. 


(20.132) 


(20.133) 


(20.134) 


(20.135) 


(20.136) 


Exercise 20.9 
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Exercise 20.9 


Here, 


A = 4 ac — b 2 . 


(20.137) 


This leads to 


/ 


dr = V ar 2 + br + c — 


2*J^a 


( 2 ar + 2Z? 
arcsm — t 

V 


c (br + 2c\ 

H —-= arcsin I 
J-c V r^ 


) 


for a < 0, c < 0, A < 0. In our case (see (20.131)), 

a = 2mE, b = me 2 , c = —C 2 , 

A = —4m(2EjC 2 + m^ 4 ). 


(20.138) 


(20.139) 


For the integral, one gets (with E < 0) 
\/2mEr 2 + 2me 1 r — C 2 


f 


-dr 


= V 2 mEr 2 + 2me 2 r — C 2 

me 2 / 4mEr -\~2me 2 

H- . arcsin -; - - 

V-2 mE V V^A 


\ f2me 2 r — 2C 2 \ 

J — £arcsin^- ^ —J. (20.140) 


Insertion of the integration limits yields 


2n 2 me 4 


— 2ixC = kh. 


(20.141) 


If one defines the “principal quantum number” n = l + k = 0, 1,2,..., the formula 
for the binding energy reads 


E 


n 


4 

me* 
2 h 2 n 2 


(20.142) 


This formula for the discrete energy levels in the hydrogen atom agrees exactly with 
the quantum mechanical result. Only the value n = 0, which was allowed in this con¬ 
sideration, is excluded in the quantum mechanical approach. The underlying classi¬ 
cal picture (electron moves in an elliptic orbit with the eccentricity £ = y/l — ( l/n ) 2 ) 
leads however to contradictions and must be modified in quantum mechanics. Because 
n — l +k, the energy levels with n — 1, 2 ,..., are twofold, threefold, ..., degenerate. 


EXERCISE 


20.10 On Poisson Brackets 

Problem. If the functions E and G depend on the coordinates q a , the momenta p a , 
and the time t , the Poisson bracket of F and G is defined as follows: 
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y^fdF dG 

[F,G] = £ 


9F dG 

V 9 ^ 9 A* 9/><* 9# a 


Show the subsequent properties of this Poisson bracket: 

(a) [F,G] = -[G,Fl 

(b) [F l + F 2 ,G] = [F l ,G] + [F 2 ,Gl 

dF 

(c) [F,q r ] = - —, and 

dF 

(d) [F,p r ] = —. 

oq r 

Solution, (a) 


E [ dF dG dF dG\ x~^( dG dF dG dF 

I-1 = — y (- 

\dq a dp a dp a dq a ) \dq a dp a dp a dq a 


,/dF dG 
\dq a 

= -[G,F]. 


We see that the Poisson bracket is not commutative. 
(b) 


[Fi + F 2 , G] = J2 


9(Fi + F 2 ) dG d(F\ + F 2 ) dG 


dqa dp a 

dF\ dG dFi dG 


dpa dq a 


or 1 dG or\ dG \ ^ I dt 2 dLr dt 2 
dq a dp a dp a dq a / ^ \ da„ dv„ do™ 


dF2 dG dF2 dG 


dqa dpa dp a dq a 


= [F\, G] + [F 2 , G]. 


Therefore, the Poisson bracket is distributive. 

(c) 


[F, q, \ = 

El 

a 

( dF dq r 
\ dqa dpa 

dF dq r 
dpa dq a 

dq r _ 
dPa 

0 

=> [ F, qA 

II 

i 

Here, 8 ra is 

the Kronecker symbol: 

8ra = 1 

for 

r = a, 


&ra = 0 

for 

r/a. 


(d) Analogously, we find 


[ F , Pr \ = 

El 

a 

(dF dp r 
\ dq a dp a 

dF dp r 
dpa dq a 



dF 

dp r 



dF 
dq r ’ 


since dp r /dq a = 0. 

We shall meet the rules on Poisson brackets in quantum mechanics again, since the 
transition to quantum mechanics (the so-called canonical quantization) is performed 


Exercise 20.10 
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Exercise 20.10 


by the transition to operators and by replacing the Poisson bracket [, ] by the commu¬ 
tator (1/ i h ){, }, where 


{A,B}_=AB - BA. 


If we form, e.g., [qi, pj ], we obtain 


r n dqi dPj 

^' dq a dp a 


dPa dq a ) 


(20.143) 


In the canonical quantization, one passes from the classical momenta pj to operator 
momenta p~j , and from the classical Poisson bracket [, ] to the quantum mechanical 
Poisson bracket (1 /ih){, }_. Thus, in the canonical quantization one substitutes the 
relation (20.143) by 


{qi,Pj}~ =ih8ij. 

Equation (20.144) is satisfied if = —ihd/dqj\ 
d 


(20.144) 


{<B,Pj}- = -ih\qi 


dqj J _ 


where the commutator operates on a function f(q \,..., q a )- For example, 


-ih\ ,q t 
dqj 


f(qi, ...,q a ] 


d d 

dqj dqj 


= —ih 

= -ih8ij • f(qi,...,q a ), 


> Qa) 


where the product rule was used and thus (20.144) is verified. The rules for the quan¬ 
tum mechanical commutators are identical with those for the Poisson brackets. One 
might say that quantum mechanics is another algebraic realization of the Poisson 
brackets. As will be seen in quantum mechanics, this conclusion is premature and 
in this form not correct. 


EXERCISE 


20.11 Total Time Derivative of an Arbitrary Function Depending on q, p, and t 


Problem. Let H denote the Hamiltonian. Show that for an arbitrary function depend¬ 
ing on qi , pi , and t we have 


df 

dt 


df 

+ u,m. 

dt 
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Solution. The total differential of the function f(pi,qt,t) reads 


df 

df = —dt + 
dt 


rv v 


df 9/ , 

—dq a + ~ — dp a 

dqa OP a 




df 

dt 



By means of the Hamilton equations 
dH _ . dH _ . 

t ; = Qot » — = —Pa, 

uP a oq a 

we can rewrite (20.146) as 

df df x-V 3/ dH df dH\ df 

f f + a - ~f— a - ) = + [/> ^ ] 

£ 9 /^ ^ op a dp a oC[(% J ut 


(20.145) 

(20.146) 


(20.147) 


Thus, the Poisson brackets enter automatically. Equation (20.147) reminds us even 
more of the results of quantum mechanics than the analogies of the last problem. In 
quantum mechanics we shall find the following expression for the time derivative of 
an operator F: 


dF 

dt 


dF 1 ^ ^ 
— + -{F,H}. 
dt in 


(20.148) 


where H represents the Hamiltonian operator of the quantum mechanical problem. It 
is, e.g., of the form 

- - ^ _ d 

H — H(x , p ) with p — —in — 

dx 

and depends in general on the coordinates, momentum operators, and possibly even 
further quantities, e.g., spin. 


Exercise 20.11 




Extended Hamilton-Lagrange Formalism 


21.1 Extended Set of Euler-Lagrange Equations 

The conventional formulation of the principle of least action (Hamilton principle, 
see (18.25)) is based on the action functional S[qj(t) ], defined by 



( 21 . 1 ) 


l a 


with L(qj,qj,t ) denoting the system’s conventional Lagrangian, and (q\(t),..., 
q n (t)) the set of configuration space variables as functions of time. In this formu¬ 
lation, the independent variable time t plays the role of the Newtonian absolute time. 
The clearest reformulation of the least action principle for relativistic physics is ac¬ 
complished by treating the time t(s) = qo(s)/c —just like the configuration space vari¬ 
ables qj(s) —as a dependent variable of a newly introduced independent variable, s. 
The idea behind is to place all space-time variables on equal footing. The action func¬ 
tional (21.1) then rewrites in terms of an extended Lagrangian L\ 



( 21 . 2 ) 


As the action functional (21.2) has the form of (21.1), the subsequent Euler-Lagrange 
equations that determine the particular path (qj(s), t(s )) on which the value of the 
functional S[qj(s ), t(s)] takes on an extreme, adopt the customary form, 



(21.3) 


Here, the index /x = 0,..., n spans the entire range of extended configuration space 
variables. In particular, the Euler-Lagrange equation for t(s) writes 



The equations of motion for both qj (s) and t(s) are thus determined by the extended 
Lagrangian L\. The solution qj (t) of the Euler-Lagrange equations that equivalently 
emerges from the corresponding conventional Lagrangian L may then be constructed 
by eliminating the evolution parameter s . 

As the actions, S and S\ , are supposed to be alternative characterizations of the 
same underlying physical system, the action principles 8 S = 0 and 8 S\ =0 must hold 
simultaneously. This means that 
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which, in turn, is assured if both integrands differ at most by the s-derivative of an 
arbitrary differentiable function T(qj , t) 


dt dT 

L— = L i +-. 

ds ds 


Functions T{qj , t) define a particular class of point transformations of the dynamical 
variables, namely those ones that preserve the form of the Euler-Lagrange equations. 
Such a transformation can be applied at any time in the discussion of a given La- 
grangian system and should be distinguished from correlating L\ and L. We may thus 
restrict ourselves without loss of generality to those correlations of L and L i, where 
T = 0. In other words, we correlate L and L\ without performing simultaneously 
a transformation of the dynamical variables. We will discuss this issue in the more 
general context of extended canonical transformations in Sect. 21.3. The extended 
Lagrangian L\ is then related to the conventional Lagrangian, L, by 


( dqj 

dt\ 

T ( dC U 0 

i dt 

dq t _ 

dqi /ds 


ds ) 

- \ qj ' dt E 

Us' 

dt 

dt/ds 


(21.4) 


The derivatives of L\ from (21.4) with respect to its arguments can now be expressed 
in terms of the conventional Lagrangian L as 


3Li 3 L dt 

dqi dqi ds ’ 

3Li 3 L dt 

3 1 3 1 ds ’ 


3Li 


3L 


»(f) Ht) 




3Li 

9 ( 1 ) 


= L ~E 


3 L dqi 


dt 


(21.5) 

( 21 . 6 ) 

(21.7) 

( 21 . 8 ) 


With qo = ct, (21.7) and (21.8) yield for the following sum over the extended range 
/x = 0 ,..., n of dynamical variables 


n 


E 


3Li dq /x 

9 ( 5 )^ 


n 


l -E 

1=1 


3 L dqi 



dt 

ds 


+E 


3 L dqi 

9(W)^ 


= L\. 


The extended Lagrangian L\ thus satisfies the constraint 


n 

L 1-E 


[ 1=0 


dL\ dqfi 

9 ( 5 )^ 


= 0 . 


(21.9) 


The correlation (21.4) and the pertaining condition (21.9) allows two interpretations, 
depending on which Lagrangian is primarily given, and which one is derived. If the 
conventional Lagrangian L is the given function to describe the dynamical system in 
question and L\ is derived from L according to (21.4), then L\ is a homogeneous form 
of first order in the n + 1 variables dqo/ds ,..., dq n /ds. This may be seen by replac¬ 
ing all derivatives dq^/ds with a x dq^/ds, a e R in (21.4). Consequently, Euler’s 
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theorem on homogeneous functions states that (21.9) constitutes an identity for L\. 
The Euler-Lagrange equation involving dtIds then also yields an identity , hence, we 
do not obtain a substantial equation of motion for t(s). In this case, the parameteri¬ 
zation of time t(s) is left undetermined—which reflects the fact that a conventional 
Lagrangian does not provide any information on a parameterization of time. 

In the opposite case, if an extended Lagrangian L\ is the primary function to de¬ 
scribe our system, then L\ is no longer a homogeneous function, in general. In that 
case, (21.9) no longer establishes an identity but (21.9) furnishes a constraint function 
for the system. Lurthermore, the Euler-Lagrange equation involving dt/ds then yields 
a non-trivial equation of motion for t(s). The conventional Lagrangian L may then be 
deduced from (21.4) by means of the constraint function (21.9). 

To summarize, by switching from the conventional variational principle (21.1) to 
the extended representation (21.2), we have introduced an extended Lagrangian L\ 
that additionally depends on dt (s)/ds . Due to the emerging constraint function (21.9), 
the actual number of degrees of freedom is unchanged. Geometrically, the system’s 
motion now takes place on a hyper-surface, defined by (21.9), within the tangent bun¬ 
dle f(Mxl) over the space-time configuration manifold MxM. This contrasts with 
the conventional, unconstrained Lagrangian description on the time-dependent tangent 
bundle (TM) x M. 

EXAMPLE 

21.1 Extended Lagrangian for a Relativistic Free Particle 

As only expressions of the form JA q 2 — c 2 t 2 are preserved under the Lorentz group, 
the conventional Lagrangian for a free point particle of rest mass mo, given by 



( 21 . 10 ) 


is obviously not Lorentz-invariant. Yet, in the extended description, a correspond¬ 
ing Lorentz-invariant Lagrangian L\ can be constructed by introducing s as the 
new independent variable, and by treating the space and time variables, qj(s) and 
qo(s) = ct(s ), equally. This is achieved by adding the corresponding derivative of the 
time variable t(s). 



( 21 . 11 ) 


The constant third term has been defined accordingly to ensure that L\ converges 
to L m in the limit dt/ds —> 1. Of course, the dynamics following from (21.10) and 
(21.11) are different —which reflects the modification our dynamics encounters if we 
switch from a non-relativistic to a relativistic description. With the Lagrangian (21.11), 
we obtain from (21.9) the constraint 



( 21 . 12 ) 
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Example 21.1 


As usual for constrained Lagrangian systems, we must not insert back the constraint 
function into the Lagrangian prior to setting up the Euler-Lagrange equations. Phys¬ 
ically, the constraint (21.12) reflects the fact that the square of the four-velocity vec¬ 
tor is constant. It equals — c 2 if the sign convention of the Minkowski metric is de¬ 
fined as ?7 /xv = = diag(—1, +1, +1, +1). We thus find that in the case of the La¬ 

grangian (21.11) the system evolution parameter s is physically nothing else than the 
particle’s proper time. In contrast to the non-relativistic description, the constant rest 
energy term — ^ me 2 in the extended Lagrangian (21.11) is essential. The constraint 
can alternatively be expressed as 


ds 

dt 


N 


i \ 2 -i 

'■ 3 U IT) =r ' 


i=1 


which yields the usual relativistic scale factor, y . The conventional Lagrangian L that 
describes the same dynamics as the extended Lagrangian L\ from (21.11) is derived 
according to (21.4) 



= Li 


\Vj> 


dt \ ds 


dt L t 

ds ’ ds J dt 


= -me 


2 1 v ^/ dqi \ dt dt ds 

c 2 “V dt ) ds ds dt 


= --moc 


ds dt 
dt ds 


,_ ( dm 

c 2 ^\dt 


— —moc 


N 


,_ f dqt 

c 2 ^\dt 


(21.13) 


We thus encounter the well-known conventional Lagrangian of a relativistic free 
particle. In contrast to the equivalent extended Lagrangian from (21.11), the La¬ 
grangian (21.13) is not quadratic in the derivatives of the dependent variables, qj(t). 
The loss of the quadratic form originates from the projection of the constrained 
description on the tangent bundle T (M x M) to the unconstrained description on 
(T M) x M. The quadratic form is recovered in the non-relativistic limit by expanding 
the square root, which yields the Lagrangian L m from (21.10). 


EXAMPLE 


21.2 Extended Lagrangian for a Relativistic Particle in an External Electromag¬ 
netic Field 

The extended Lagrangian L\ of a point particle of rest mass mo and charge f in an 
external electromagnetic field that is described by the potentials <fi(qj , t) and A(qj , t) 
is given by 



i 


(21.14) 
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The associated constraint function coincides with that for the free-particle Lagrangian 
from (21.12) as all terms linear in the velocities drop out calculating the differ¬ 
ence in (21.9). Similar to the free particle case from (21.13), the extended La¬ 
grangian (21.14) may be projected into (T M) x M to yield the well-known conven¬ 
tional relativistic Lagrangian L 


L 




-mQC 


N 


! yV —tL 

c 2 ^\dt 




(21.15) 


Again, the quadratic form of the velocity terms is lost owing to the projection. 

For small velocities dqj/dt , the quadratic form is regained as the square root in 
(21.15) may be expanded to yield the conventional non-relativistic Lagrangian for a 
point particle in an external electromagnetic field, 


/ dqj \ 1 yr-^f dqi 

L A q j' dt' i 9 w °L( ,, t 


dt 


f a dm c. , 2 

- * M—rr - £0 - me . 




dt 


(21.16) 


Significantly, this Lagrangian can be derived directly , hence without the detour over 
the projected Lagrangian (21.15), from the extended Lagrangian (21.14) by letting 
dt/ds —> 1. 

Comparing the Lagrangian (21.16) with the extended Lagrangian from (21.14), 
we notice that the transition to the non-relativistic description is made by identify¬ 
ing the proper time s with the laboratory time t = qo/c. The remarkable formal sim¬ 
ilarity of the Lorentz-invariant extended Lagrangian (21.14) with the non-invariant 
conventional Lagrangian (21.16) suggests that approaches based on non-relativistic 
Lagrangians L m may be transposed to a relativistic description by (i) introducing the 
proper time s as the new system evolution parameter, (ii) treating the time t(s) as 
an additional dependent variable on equal footing with the configuration space vari¬ 
ables q(s )—commonly referred to as the “principle of homogeneity in space-time”— 
and (iii) by replacing the conventional non-relativistic Lagrangian L m with the cor¬ 
responding Lorentz-invariant extended Lagrangian L i, similar to the transition from 
(21.16) to (21.14). 


Example 21.2 


21.2 Extended Set of Canonical Equations 

The Lagrangian formulation of particle dynamics can equivalently be expressed as a 
Hamiltonian description. The complete information on the given dynamical system is 
then contained in a Hamiltonian H , which carries the same information content as the 
corresponding Lagrangian L. It is defined by the Legendre transformation 

i=\ 

with the canonical momenta pt being defined by 
dL 

Pi = h¥Y 


(21.17) 
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Correspondingly, the extended Hamiltonian H\ is defined as the extended Legendre 
transform of the extended Lagrangian L \ as 


n 

H\ (q h p h t,e) = Y2 

i=1 



dt 
e — 
ds 


Li 




(21.18) 


Herein, — e denotes the conjugate quantity of time t. Corresponding to the con¬ 
ventional formalism, we assume the extended Lagrangian L\ to be regular , which 
means that the Hesse matrix d 2 Li/\d(dq ll /ds)d(dq v /ds)\ be invertible. We know 
from (21.7) that for i = 1,..., n the momentum variable pt is equally obtained from 
the extended Lagrangian L i, 


Pi = 


dL i 

h¥>' 


(21.19) 


This fact ensures the Legendre transformations (21.17) and (21.18) to be compatible. 
For the corresponding definition of p o, we must take some care as the derivative of L\ 
with respect to dt/ds evaluates to 


dL i 

HS) 


= l ~J 2 


dqt dL 


et * 




The momentum coordinate po that is conjugate to qo = ct must therefore be defined 
as 


Po( s ) = e(s) = H(qj(s), Pj(s),t(s)), (21.20) 

with e(s) representing the instantaneous value of the Hamiltonian H at s, but not the 
function H proper. This distinction is essential as the canonical coordinate po must 
be defined—like all other canonical coordinates—as a function of the independent 
variable only. The reason is that the q^, p M with /r = 0,..., n depict the coordinates 
pertaining to the base vectors that span the (symplectic) extended phase space. We 
may express this fact by means of the comprehensible notation 

»«=*<')» ew =-s, w ' <2i2i) 

°\ds) 

The constraint function from (21.9) translates in the extended Hamiltonian description 
simply into 

pj(s),t(s), e(s)) = 0. (21.22) 


This means that the extended Hamiltonian H\ directly defines the hyper-surface on 
which the classical motion of the system takes place. The hyper-surface lies within the 
cotangent bundle T*(M x M) over the same extended configuration manifold MxR 
as in the case of the Lagrangian description. Inserting (21.19) and (21.21) into the 
extended set of Euler-Lagrange equations (21.3) yields the extended set of canonical 
equations, 


dPjM 

ds 


dq /x _ dff 
ds dp^ ' 


d Hi 


(21.23) 
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The right-hand sides of these equations follow directly from the Legendre transforma¬ 
tion (21.18) since the Lagrangian L\ does not depend on the momenta p M and has, 
up to the sign, the same space-time dependence as the Hamiltonian H\ . The extended 
set is characterized by the additional pair of canonical equations for the index /x = 0, 
which reads in terms of t (s) and e(s) 


de dH\ dt dH\ 

ds 3 1 ’ ds de 


(21.24) 


In contrast to the total time derivative of the Hamiltonian H(qj, pj,t ), the total s 
derivative of the extended Hamiltonian H\(q v , p v ) always vanishes. Calculating the 
total s derivative of H\ , and inserting subsequently the extended set of canonical equa¬ 
tions (21.23), we find 


dH\ 

ds 


dHi dq/i + 

^ 3 q u ds 
11=0 L 


dH\ dp ij, 
dPfjL ds 



3 Hi 3 Hi 
dq/i dpii 


3Hi 3Hi’ 

dPii dqii _ 


Formally, an extended Hamiltonian H\ (p v , q v ) = const, thus describes an autonomous 
Hamilton system, hence a system that does not explicitly depend on its independent 
variable. 

By virtue of the Legendre transformations (21.17) and (21.18), the correlation 
from (21.4) of extended and conventional Lagrangians is finally converted into 

H l (q j ,p,t,e) = (H(q j ,p,t)-e)^- s , (21.25) 

as only the term for the index /x = 0 does not cancel after inserting (21.17) and (21.18) 
into (21.4). 

The conventional Hamiltonian H is defined as the particular function whose 
value coincides with the extended phase-space variable e. In accordance with (21.20) 
and (21.22), we thus determine H for any given extended Hamiltonian H\ by solving 
H\ = 0 for e. Then, H emerges as the right-hand side of the equation e = H. 

In the converse case, if merely a conventional Hamiltonian H is given, and H\ is set 
up according to (21.25), then the canonical equation for dt /ds yields an identity , hence 
allows arbitrary parameterizations of time. This is not astonishing as a conventional 
Hamiltonian H generally does not provide the information for an equation of motion 
for t(s). 


EXAMPLE 


21.3 Trivial Extended Hamiltonian 

The trivial extended Hamiltonian H\ is defined by 

Hi(qj, pj,t,e) = H(qj, pj,t) - e. 

According to (21.24), the canonical equation for dt/ds is obtained as 

dt dH\ 

ds de 


(21.26) 
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Up to arbitrary shifts of the origin of our time scale, we thus identify t(s) with s. As all 
other partial derivatives of H\ coincide with those of H , so do the respective canonical 
equations. The system description in terms of H\ from (21.26) is thus identical to the 
conventional description by a Hamiltonian H and does not provide any additional 
information. 


EXAMPLE 


21.4 Hamiltonian of a Free Relativistic Particle 

We present the systematic approach how to set up the Lorentz-invariant extended 
Hamiltonian H\ v for a given non-invariant conventional Hamiltonian H m . For the 
case of a free particle of rest mass mo, the non-relativistic Hamiltonian that includes 
the particle’s rest energy is given by 

v 2 

H m (p) = --1- m 0 c 2 . (21.27) 

2m o 

Herein, p denotes the 3-component vector of particle momenta, p = (p \, p 2 , pf). The 
equivalent extended Hamiltonian (21.26) that yields the same dynamics in terms of the 
subsequent canonical equations (21.23) is then 

p 2 2 

#i,nr (P,e)=- - e + m 0 c 2 , (21.28) 

2 m o 

in conjunction with the general side condition for extended Hamiltonians, 
H\ m (q, P , t, e) = 0. As solely expressions of the form q 2 — c 2 t 2 and p 2 — e 2 /c 2 
are maintained under Lorentz transformations, (see Example 21.18), the Hamiltoni¬ 
ans (21.27) and (21.28) are obviously not Lorentz invariant. In the description of ex¬ 
tended Hamiltonians , the corresponding Lorentz-invariant form of (21.28) can easily 
be constructed 


H u (p,e) 




i 2 

+ -moc . 


(21.29) 


The constant term was adjusted to preserve the relation e = moc 2 for p = 0. The side 
condition H\ = 0, which represent an implicit function , now yields the relativistic 
energy-momentum correlation 


2 2 2 , 2 4 

e = p c + m^c . 


(21.30) 


Although p and e denote formally independent canonical variables, only those com¬ 
binations of p and e have physical significance that satisfy (21.30). Of course, 
the canonical equations that follow from (21.29) are different from those following 
from (21.28). This reflects the modification that a system’s description encounters 
if we switch from a non-relativistic to a relativistic viewpoint. The extended set of 
canonical equations (21.23) emerging from the extended Hamiltonian (21.29) is 

d#i, r _ dpt _ Q d#i, r _ dq t _ p t 

dqi ds ’ 3 pi ds mo ’ 








21.2 Extended Set of Canonical Equations 


423 


= J_Hu 

3 1 ds ’ de 


dt 


ds moc 2 


In conjunction with the energy-momentum correlation from (21.30), the non-trivial 
canonical equations are expressed as 


dqt _ Pi 
ds mo ’ 


dt 


p 2 c 2 + mlc 4 


ds moc 2 moc 2 


(21.31) 


We may finally rewrite the canonical equation for the qt in terms of the time t as the 
independent variable 


dqi dqi ds pi moc 2 piC 2 

dt ds dt mo e e 


PiC* 


P 2 c 2 + m 2 0 c 4 


dHrip) 

dpt 


(21.32) 


But this is nothing else than the non-trivial canonical equation of the conventional 
Hamiltonian 


H t (p ) = e - yj p 2 c 2 + WqC 4 . 


(21.33) 


We thus encounter the well-known conventional Hamiltonian H r (p ) of the free rela¬ 
tivistic particle. In contrast to the extended Hamiltonian (21.29), the physically equiv¬ 
alent conventional Hamiltonian (21.33) does not manifest anymore its Lorentz invari¬ 
ance. 

To complete this example, we show that the extended Hamiltonian (21.29) also 
emerges as the Legendre-transformed Lagrangian (21.11) from Example 21.1. The 
extended Legendre transformation that relates extended Lagrangians with extended 
Hamiltonians was defined in equations (21.18), (21.19), and (21.21) of Sect. 21.2. For 
the addressed case, the canonical momenta evaluate to 


Pi 


3Li 

h¥) 


--mo 


dqt 

ds 




3Li 

hW) 


2 dt 

■moc 

ds 


The extended Hamiltonian is then obtained by expressing the derivatives dqi /ds and 
de/ds that are contained in the Lagrangian and in the Legendre transformation rule in 
terms of the momenta pi and e , 


H l (p,e) = T — -- L -2-Li(P’e)’ L x (p,e) = Y^ - 

‘ ^ W! r\ yt'l r\ ^ / W1 r\ 


mo moc 


EL 

2m o 


Imoc 2 


1 2 

-moc 


The extended Hamiltonian is then 


Hi(p,e) = -^— 
2m o 


2 "I 


L i 

which coincides with (21.29). 


+ -mo c 


Example 21A 


EXAMPLE 


21.5 Hamiltonian of a Relativistic Particle in a Potential V(q,t) 

The non-relativistic dynamics of a particle in a potential V(q,t) is described by the 
Hamiltonian 
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Analogously to the dynamics of a free relativistic particle, treated in Example 21.4, 
the relativistic dynamics of a particle in an external potential V(q,t ) is described by 
the extended Hamiltonian 


Hy v (q,p,t,e) 



e-V(q,t) 


+ -mo c 


(21.35) 


The constant term was chosen to ensure that for p = 0 and V (q , t) = 0 the constraint 
H\ jT = 0 leads to e = moc 2 . Consequently, for the general case H\ r = 0 induces the 
scleronomous constraint 

(e — V(q, t)) 2 = p 2 c 2 + mQC 4 . (21.36) 


Again, q, p,t and e represent independent canonical variables, but only those combi¬ 
nations of q, p, t and e have a physical meaning which satisfy (21.36). 

The extended set of canonical equations (21.23) emerging from the extended 
Hamiltonian (21.35) is 

8H\j dpi e — V(q,t)dV dH\j dqi pt 

dqi ds moc 2 dqi ’ dpi ds mo ’ 

dHi tT _ de _ e - V(q, t ) dV dH u _dt _e- V(q, t) 

3 1 ds moc 2 3 1 ’ de ds moc 2 

We may express the canonical equations equivalently using the time t as the indepen¬ 
dent variable, and eliminate the canonical variable e by means of the constraint (21.36) 


dpi dpi ds e — V(q,t)dV moc 2 dV dH x 

dt ds dt moc 2 dqie — V(q,t ) dqi dqi 

dqi _ dqi ds _ p t moc 2 _ p t c 2 _ p t c 2 _ 3 H v 
dt ds dt mo e — V(q,t) e-V(q,t ) p ^ c 2 + m 2 c 4 dpi ’ 

de de ds e — V(q,t)dV moc 2 dV 3 H v 
dt ds dt moc 2 dt e — V(q,t ) dt dt 

These equations can be conceived to represent the canonical equations emerging from 
the conventional Hamiltonian 

H x (q, p,t) = e = y p 2 c 2 + mlc 4 + V(q, t). (21.37) 

We thus encounter the Lorentz invariant form of the conventional Hamiltonian for a 
particle in an external potential V(q,t). The Hamiltonians H\ v from (21.35) and H r 
from (21.37) are physically equivalent , hence describe the same dynamics. On the 
other hand, the extended Hamiltonian H\ r additionally determines the parameteriza¬ 
tion of time t = t(s). 

We finally note that the extended Hamiltonian (21.35) can be derived according 
to (21.18), (21.19), and (21.21) as the Legendre transformed function of the extended 
Lagrangian 
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. dqj dt 


1 

= -mo C 


2 1 

C 2 ds 
i 


dt y 

ds / 


dt 

V(q, t) —. 
ds 


(21.38) 


From (21.38), the correlations of the “velocities” dqt /ds and dt /ds with the canonical 
momenta, pi and —e, evaluate to 


Pi = 


9L i 


dg; 

_W ° ds 


3 ® 


e = — 


dL i 


9 d£ 

= m 0 c z —- + V(q,t). 
ds 


(21.39) 
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EXAMPLE 


21.6 Relativistic “Harmonic Oscillator” 


In this example, we discuss the relativistic motion of a particle in a quadratic external 
potential. According to (21.35), the extended Hamiltonian of this system is given by 


Hi, r (q,p,e) 




+ -me 


(21.40) 


The associated constraint p, e) = 0 yields a relation of the formally indepen¬ 

dent canonical variables p, q, and e 

p 2 c 2 — — ^kq 2 ^j +m^c A = 0. (21.41) 

Solving this relation for e, we obtain the corresponding conventional Hamiltonian H v 
as the right-hand side of the equation e = H v , 






2^2 -\- m 2 c 4 _j_ - fcq 2 


(21.42) 


The extended set of canonical equations following from the extended Hamil¬ 
tonian (21.40) are 


9 H\ v _ dqj _ pi 
dpt ds mo ’ 

dH\ r de 

-— = — = 0 , 

dt ds 


dH\ r dpt e - Ikq 2 

- = ~T = - 2~ kqi 

aqi ds moc 

dH\j dt e—\kq 2 

de ds moc 2 


We may again express these equations equivalently by eliminating e according 
to (21.41) and by replacing s with the laboratory time t as the system’s independent 
variable 


dqi dqi ds pic 2 dH r 

dt ~ ds dt~ y^ + ^4 ~~ 3 Pi ’ 

dpt dpt ds ^ d H r 

dt ds dt dqt 


(21.43) 
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As expected, we encounter the canonical equations of the conventional Hamil¬ 
tonian (21.42). The pair of first-order equations can be merged into a single second- 
order equation for qi (t), 


k 

q(t) -\ - 

mo 


1 - 


q 2 {t) 


q(t) = 0. 


(21.44) 


For q(t) —> c we thus have q(t) —> 0. In agreement with the postulates of special 
relativity, the speed of light, c, constitutes the absolute limit for the particle’s velocity, 
q(t). The term in brackets forms a power of the relativistic correction factor, y 


y- 2 = l- 



(21.45) 


The equation of motion (21.44) may thus be rewritten concisely as 

q(t) + -^q(t) = 0. (21.46) 

moy 

In this form, the equation of motion appears to agree with its non-relativistic coun¬ 
terpart, except for the occurrence of the relativistic correction factor y 3 in front of 
the rest mass term m o. For q « c, hence for y 1, indeed the equation of motion 
of the ordinary harmonic oscillator comes out. Yet, the explicit form (21.44) of the 
equation of motion shows that we are dealing with a non-linear system, with solutions 
that no longer consists of harmonic oscillations. Thus, strictly speaking, a relativistic 
oscillator can never be a harmonic oscillator. 

In accelerator physics, the quantity moy 3 is referred to as the “longitudinal mass” 
of an ion beam particle. In the laboratory system, the longitudinal oscillation fre¬ 
quency of the relativistic motion of a particle in an accelerator or high energy storage 
ring appears as if the particle’s mass be increased by a factor of y 3 . 


EXAMPLE 


21.7 Extended Hamiltonian for a Relativistic Particle in an External Electromag¬ 
netic Field 


The Hamiltonian counterpart H\ of the extended Lagrangian (21.14) from Exam¬ 
ple 21.2 for a relativistic point particle in an external electromagnetic field is obtained 
via the Legendre transformation prescription from (21.18). According to (21.19) 
and (21.21), the canonical momenta pi and po are introduced by 


3Li dqt f 

Pi = , da .. =m — + —Aj(q, t), 

3 ® ds c 

dL\ dqo £ 

po = 77^ = m o-i - <Kq,t). 

9® ds c 


(21.47) 


We notice that the kinetic momentum pi^ = m dqt /ds differs from the canonical mo¬ 
mentum pi in the case of a non-vanishing external potential Af / 0. The condition for 
the Legendre transform of L \ to exist is that its 4 x 4 Hessian matrix with elements 
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d 2 L\/\d(dq^/ds)d(dq v /ds)] must be non-singular, hence that the determinant of this 
matrix does not vanish. For the extended Lagrangian (21.14) from Example 21.2, this 
is actually the case as 


det 


9 2 L i 


3 (^) 3 ® 


I/O. 


This falsifies claims occasionally found in literature that the Hesse matrix associated 
with an extended Lagrangian L\ be generally singular , and that for this reason an ex¬ 
tended Hamiltonian H\ generally could not be obtained by a Legendre transformation 
of an extended Lagrangian L \. 

With the Hessian condition being actually satisfied, the extended Hamiltonian H\ 
that follows as the Legendre transform (21.18) of L\ evaluates to 


H\ (q, p, t, e) = 


2m o 


p--A(q,t ) ) - 


e-^iqj) 


+ -moc 2 . (21.48) 


The constraint H\ = 0 then furnishes the usual relativistic energy-momentum relation 

(e - Z(/)(q,t)) 2 = c 2 (p - ^A(q,t)^j +m\c A . (21.49) 

The conventional Hamiltonian H(q, p,t) that describes the same dynamics is deter¬ 
mined according to (21.20) as the particular function, whose value coincides with e. 
Solving H i = 0 from (21.48) for e , we directly find H as the left-hand side of the 
equation H = c, 


Example 21.7 


H = 





2 

+ «qC 4 + t;4>(q, t ) = e. 


(21.50) 


The Hamiltonian H m (q,p,t ) that describes the particle dynamics in the non- 
relativistic limit is obtained from the Lorentz-invariant Hamiltonian (21.50) by ex¬ 
panding the square root 

H m = 2—(p -~A(q,t)\ +Z<t>(q,t) + m 0 c 2 . (21.51) 

2m o \ c ) 

In contrast to the extended Lagrangian description, a direct way to transpose the rel¬ 
ativistic extended Hamiltonian from (21.48) into the non-relativistic Hamiltonian H m 
does not exist. We conclude that the Lagrangian approach is more appropriate if we 
want to “translate” a given non-relativistic Hamilton-Lagrange system into the corre¬ 
sponding Lorentz-invariant description. 

In order to show that the extended Hamiltonian (21.48) and the well-known con¬ 
ventional Hamiltonian (21.50) indeed yield the same dynamics, we now set up the 
extended set of canonical equations (21.23) for the extended Hamiltonian (21.48) 


dpt 

ds 



moc 2 


(e ~ f 0) 


30 

dqt ’ 


de 

ds 


mqc 




dAk 
dt + 


moc 2 


(e ~ ( 0 ) 


30 

~dt 9 
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dqt 

ds 

dt 

ds 


— [PI--A,], 

mo ~ 

1 


moc A 


r (*?-f0). 


(21.52) 


From the last equation, we deduce the derivative of the inverse function s = s(t) and 
insert the constraint from (21.49) 


ds moc 2 moc 2 

* e ^ 


(21.53) 


The canonical equations (21.53) can now be expressed equivalently with the time t as 
the independent variable 


dpi 

dt 


de 

dt 


dqt 

dt 


dpi ds 
ds dt 


JcHp 

de ds 
ds dt 


(c _ 

^A(q,t)) 2 + m^c 4 




c 


dAk 90 
dqi dqt ’ 


_£c_ 

/ c 2 (p - ^A(q, r)) 2 + m^c 4 

ds c 2 



ds 



^A(q,t)) 2 +mlc 4 



dAk_ 90 

9? dt’ 



(21.54) 


The right-hand sides of (21.54) are exactly the partial derivatives dH/dqt, dH/dt, and 
dH/dpi of the Hamiltonian (21.50) —and hence its canonical equations, which was to 
be shown. 

The physical meaning of the dt/ds is worked out by casting it to the equivalent 
form 


£= , + ( ' , --ff , » 2 =J 1 + (^) 2 = rW . 

ds ^ m^c z V V m Q c / 

with /? k (s) the instantaneous kinetic momentum of the particle. The dimensionless 
quantity dt/ds thus represents the instantaneous value of the relativistic scale fac¬ 
tor y. 


21.3 Extended Canonical Transformations 

The conventional theory of canonical transformations is built upon the conventional 
action integral from (21.1). In this theory, the Newtonian absolute time t plays the role 
of the common independent variable of both original and destination system. Similarly 
to the conventional theory, we may build the extended theory of canonical equations 
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on the basis of the extended action integral from (21.2). With the time t = qo/c and the 
configuration space variables qt treated on equal footing, we are enabled to correlate 
two Hamiltonian systems, H and H ', with different time scales, t (s) and T ( s ), hence 
to canonically map the system’s time t and its conjugate quantity e in addition to the 
mapping of generalized coordinates q and momenta p. The system evolution parame¬ 
ter s is then the common independent variable of both systems, H and H'. A general 
mapping of all dependent variables may be formally expressed as 

2/x = 2/z(#v> Pv)i Pji = ^/x(^v? Pv)i P = 0, • • •, n. (21.55) 

Completely parallel to the conventional theory, the subgroup of transformations (21.55) 
that preserve the action principle 8S\ = 0 of the system is referred to as “canonical.” 
The action integral (21.2) may be expressed equivalently in terms of an extended 
Hamiltonian by means of the Legendre transformation (21.18). We thus get the fol¬ 
lowing condition for a transformation (21.55) to be canonical 


ni>£-«■(*•*> 

JSa L/i=o 


ds 



-H[(Q v ,P v ) 


ds. 


(21.56) 


As we are operating with functionals, the condition (21.56) holds if the integrands dif¬ 
fer at most by the derivative dT\jds of an arbitrary differentiable function T\ (q v , Q v ) 


V „ dq ^ u _ P d Q 

/x=0 ji =0 


ds 


i 1 frt, dT \ 


(21.57) 


We restrict ourselves to functions T\{q v , Q v ) of the old and the new extended config¬ 
uration space variables, hence to a function of those variables, whose derivatives are 
contained in (21.57). Calculating the s-derivative of T\, 


dJ~\ _ dT\ dq^ ^ dT\ dQ^ 
ds “L ds 92/x ds J’ 


(21.58) 


we then get unique transformation rules by comparing the coefficients of (21.58) with 
those of (21.57) 


Pu - 


dFi 

dq^ 


8T\ 

Wu 




(21.59) 


T\ is referred to as the extended generating function of the—now generalized— 
canonical transformation. The extended Hamiltonian has the important property to be 
conserved under these transformations. Corresponding to the extended set of canon¬ 
ical equations, the additional transformation rule is given for the index p = 0. This 
transformation rule may be expressed equivalently in terms of t(s), e(s), and T(s ), 
E(s) as 
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with E , correspondingly to (21.20), the value of the transformed Hamiltonian H r 

Po(s) = -EE, e(s) = H'(Q(s), P(s), T(s)). (21.61) 

The transformed Hamiltonian H' is finally obtained from the general correlation of 
conventional and extended Hamiltonians from (21.25), and the transformation rule 
H[ = H\ for the extended Hamiltonian from (21.59) 

r , r n dt 

7-, - £]_ = [«(„. p.o _ e] _. 

Eliminating the evolution parameter s, we arrive at the following two equivalent trans¬ 
formation rules for the conventional Hamiltonians under extended canonical transfor¬ 
mations 


[H'(Q, P,T)-E]— = H(q, p, t ) - e, 
[H(q, p,t)- e ]^ = H\Q, P, T) - E. 


(21.62) 


The transformation rules (21.62) are generalizations of the rule for conventional 
canonical transformations as now cases with T / t are included. We will see at the 
end of this section that the rules (21.62) merge for the particular case T — t into the 
corresponding rules (19.10), (19.12) of the conventional canonical transformation the¬ 
ory. 

By means of the Legendre transformation 


J~2(civi Pv) — Qv) + 'y ' G/x^/x? 

/x=0 




dTi 

92/ 


(21.63) 


we may express the extended generating function of a generalized canonical trans¬ 
formation equivalently as a function of the original extended configuration space vari¬ 
ables q y and the extended set of transformed canonical momenta P v . As, by definition, 
the functions T\ and T 2 agree in their dependence on the so do the corresponding 
transformation rules 

dJ-\ dJ~2 
9^ = 3^ = /V 

This means that all q^ do not take part in the transformation defined by (21.63). Hence, 
for the Legendre transformation, we may regard the functional dependence of the gen¬ 
erating functions to be reduced to T\ =T\ (Q v ) and T 2 = ^2(Pv)- The new transfor¬ 
mation rule pertaining to T 2 thus follows from the P v -dependence of T 2 


df 2 

dPv 


E 

/x=0 


dj~\ dQ u dQu 9/E 

-h PII -h Qn - 

V^Q^dP, 3P V ^ dP v 


E 

/jL= 0 1 ” 


dQu dQ u 

-p^ + p^ + Q^v 


3 P v 


= Qv 
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The new set of transformation rules, which is, of course, equivalent to the previous set 
from (21.59), is thus 


dj~ 2 dJ~ 2 / 

/v = ^’ ^ = 77^, H[ = H!. 

oru 




(21.64) 


Expressed in terms of the variables q, p,t,e, and Q, P, T, E the new set of coordinate 
transformation rules takes on the more elaborate form 


Pi = 


a ^ 2 
Hi ’ 


Qi = 


d^ 2 

9/T 


e = — - 


dT 2 

~dT 9 


T = - 


97-2 

~3E' 


(21.65) 


Similarly to the conventional theory of canonical transformations, there are two more 
possibilities to define a generating function of an extended canonical transformation. 
By means of the Legendre transformation 


ftiPv'Qv) — (ptviQv) ^ ] tf/lP/li 

/i=0 


Pjl — 


dfi 

d<ln ’ 


we find in the same manner as above the transformation rules 
dJ -3 aj^ , 

q [ i = ~ 1 r ± , P n = -T7r’ H l = H l- ( 21 . 66 ) 

Hii 9Q/x 

Finally, applying the Legendre transformation, defined by 


xiPvi Pv) — ^3 (.Pvi Qv ) H” ^ ] 6/Z^/Z> PjJL — 


/i=0 


9^3 


the following equivalent version of transformation rules emerges 


Vn = 


dJ ~4 
Hu' 


dj ~4 , 

G/x = — h[ = h x . 

dP/ 


(21.67) 


Calculating the second derivatives of the generating functions, we conclude that the 
following correlations for the derivatives of the general mapping from (21.55) must 
hold for the entire set of extended phase-space variables, 


96/x _ Hv 96/x _ 

dq v dP/ dp v 

dP/i _ dPv 9P m 

dq v dQ/i dpv 

Exactly if these conditions are fulfilled for all /x,v = 0,...,w, then the extended co¬ 
ordinate transformation (21.55) is canonical and preserves the form of the extended 
set of canonical equations (21.23). Otherwise, we are dealing with a general, non- 
canonical coordinate transformation that does not preserve the form of the canonical 
equations. 


9<?v 
"9P/ 
dq v 

96^ 


( 21 . 68 ) 









432 


21 Extended Hamilton-Lagrange Formalism 


EXAMPLE 


21.8 Identical Canonical Transformation 

As the first example of an extended generating function T 2 (qj, Pj,t, E), we consider 
the generating function of the identical transformation 

n n 

Pj,t, g)= p t i=q, p, - tE. 

li=0 i=1 

The particular transformation rules for this case follow from their general form, given 
by (21.65), 

Pi —Pi, Qi=qi, e = E, T = t, H[ = Hi . 

According to (21.62), the transformation rule for the extended Hamiltonians, 
H[ = H\, yields now for the conventional Hamiltonians, H' and H, 

which means, after inserting the coordinate transformation rules, 

H'(Q j ,P j ,T) = H(q j ,p j ,t). 

The existence of a neutral element is a precondition for the set of extended canonical 
transformations of a Hamiltonian system H (pj , qj , t) to form a group. 


EXAMPLE 


21.9 Identical Time Transformation, Conventional Canonical Transformations 

The connection of the extended canonical transformation theory with the conventional 
one is furnished by the particular extended generating function 


Fifaj, PjH, E) = F 2 (qj, Pj,t) - tE , 


(21.69) 


with F 2 (qj, Pj, t) denoting a conventional generating function. According to (21.65), 
the coordinate transformation rules following from (21.69) are 


Pi = 


dF 2 
Hi ’ 


dF 2 


dF 2 

e = ~i^ + E ’ 


T — t. 


Together with the general transformation rule (21.62) for conventional Hamiltoni¬ 
ans, we find the rule for Hamiltonians under conventional canonical transformations 
from (19.12), 

H’{Qj, Pj,t ) = H(qj, Pj ,t) + E-e = H( qj , Pj ,t ) + 


Canonical transformations that are defined by extended generating functions of the 
form of (21.69) leave the time variable unchanged and thus define the subgroup of 
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conventional canonical transformations within the general group of extended canoni¬ 
cal transformations. In the present example, the time t also forms a common indepen¬ 
dent variable of both the original and the transformed system —just as presupposed in 
case of a conventional canonical transformation. Corresponding to the trivial extended 
Hamiltonian from (21.26), we may refer to (21.69) as the trivial extended generating 
function. 

EXAMPLE 

21.10 Extended Point Transformations 

We consider the extended canonical transformation defined by an extended generating 
function that is linear in the P v , 

n n 

Fi(q v , p v ) = X ! p <* f<Mv) = X! PifMj't) - E Mqj,t)/c, 

<2=0 i =1 

with the fciipv) denoting arbitrary differentiable functions. The transformation 
rules (21.64) for this T 2 follow as 

Gm = /m(9v). = H[(Q v ,P v ) = H x {q v ,p v ). 

«=0 

The new configuration space coordinates Qi , i = 1,..., n and the new time T = Qo/c 
thus emerge as functions of the original configuration space coordinates qt and time 
t = qo/c, without any dependence on the canonical momenta and energy. Similar to 
the case of a conventional canonical transformation (see Example 19.2), mappings of 
this type are referred to as point transformations. 

For the particular case /o = fo(t), we get the rule T = fo(t)/c. The transformed 
time T then only depends on the original time, t, and not on the configuration space 
variables, qi. For multi-particle systems, T then retains in the transformed system the 
property of the original system’s time t to be common to all particles. 

EXAMPLE 


21.11 Time-Energy Transformations 

The general form of an extended generating function that defines an extended 
canonical transformation that leaves position and momentum coordinates invariant , 
but transforms solely time t and energy e, is given by 

n 

ftiPj, Qj ,e,T) = -J2 Pi Qt + f(e, T). 
i =1 

Herein, f(e,T ) denotes an arbitrary differentiable function of the energy e of the 
original system, and of the time T of the transformed system. According to (21.66), 
the particular transformation rules are 

df df 

Pi=Pi , qi = Qi, E = t = H[ = Hi. (21.70) 
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The transformed conventional Hamiltonian H f follows from (21.62) as 



The special case f(e,T) = eT again yields the identical transformation. We observe 
that the transformed Hamiltonian H' emerges from the original Hamiltonian H by 
multiplication with a the factor d 2 f/dedT. This contrasts to the case of conventional 
canonical transformations from Chap. 19, for which H' emerges from H by addition 
of the partial time derivative of a conventional generating function Ti,2 ,3,4- 

EXAMPLE 

21.12 Liouville’s Theorem in the Extended Hamilton Description 

For extended canonical transformations, the pertaining generalized form of Liou- 
ville’s theorem applies analogously to the conventional Liouville theorem from Ex¬ 
ample 19.6, 

gen. can. transf. 

dQo...dQ n dPo...dP n = dqo .. .dq n dpo .. .dp n . 

This may be written equivalently as 


gen. can. transf. 


dqi ... dq n dpi ... dp n dt de. 


dQ \.. .dQ n dP \.. .dP n dT dE 


The generalized form of Liouville’s theorem thus states that the extended volume form 
dV\ = dqo ... dq n dpo ... dp n dt de is conserved under extended canonical transfor¬ 
mations , hence that the determinant D that is associated with the Jacobi matrix of the 
transformation is always unity. As the amount of canonical variables remains an even 
number in the extended description, we may again represent D by 


d(6o, •••> Qn) B(po,...,p n ) 
d(qo,...,q n ) _d(Po,...,P n ) 


If the transformation rules can be derived from a generating function of type 
Ti((fri, P/jl), then we are dealing with the particular case of a canonical coordinate 
transformation. Inserting the equations for Q v and p v yields 



This equation holds as (i) the partial derivatives may be interchanged, and (ii) due 
to the fact that transpose matrices have the same determinant. We will see in Exam¬ 
ple 21.18 that under generalized canonical transformations—hence transformations 
that also map the time scales of original and destination systems—only the gener¬ 
alized version of Liouville’s theorem applies, and not the conventional form from 
Example 19.6. 

EXAMPLE 

21.13 Extended Poisson Brackets 


In Example 19.7, we proved the invariance of conventional Poisson brackets [F, G] 
under conventional canonical transformations. We will show that under generalized 
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canonical transformations, the invariance property holds for extended Poisson brack¬ 
ets , [F, G] e . In analogy conventional brackets, the extended Poisson brackets are de¬ 
fined by 


[F, G] e = 



dF 3G 
Hti 3 Pn 


dF 3 G 
3 P/i 3 


= [F, G] 


3 F 3 G dF 3 G 
3 1 de de dt 


(21.71) 


Herein, F = F(q, L . p, L ) = F(q,, p : .t, e ) and G = G(qp, L ) = G(qi,pi,t, e ) denote 
arbitrary differentiable functions. Due to the complete analogy of conventional and 
extended canonical transformation formalisms, the proof of the invariance of extended 
Poisson bracket under extended canonical transformations formally coincides with 
that of Example 19.7. We thus find that under extended canonical transformations 


[F, G] 


e 

q,p,t,e 


= [ F , G] 


& Q,P,T,E' 


(21.72) 


We will show in Example 21.18 that the Lorentz transformation can be conceived 
as a particular extended canonical transformation. Consequently, extended Poisson 
brackets are always Lorentz invariant. 

The total s derivative of a function / = f(qi , p\ , t) is 


df_ _ y^/ 3/ dqi_ + yf_dpi \ 

ds ds dpi ds ) 3 1 ds 

= yV Elt i d JL dPfl ) 

^\ 3 q^ ds dpn ds ) 

= ± (*L d -*-*L d JF) = [f , Hll 

Pn dp^dqj 


(21.73) 


In the context of the extended Hamilton formalism, the extended Poisson bracket of an 
explicitly time-dependent function f(qi , pi , t) with the extended Hamiltonian H\ thus 
yields directly the total derivative of / with respect to the independent variable, s. This 
agrees formally with the conventional Poisson bracket of a conventional Hamiltonian 
H with a function fiyp , p{) that does not explicitly depend on time t. In that case, we 
obtain the total time derivative of /. 


EXAMPLE 


21.14 Canonical Quantization in the Extended Hamilton Formalism 

By canonical quantization , we denote a formalism to derive the quantum mechanical 
equations of motion for a complex wave function x/siq^) on the basis of a correspond¬ 
ing classical system that is described by an extended Hamiltonian H\ . Explicitly, this 
means to replace the classical momenta p v with momentum operators p v , along with 
the replacement of the classical extended Poisson brackets [, ] e by quantum mechan¬ 
ical commutators { , }_. The quantum mechanical commutator { , }_ of two operators 
A and B is defined as 

{A,B} =AB - BA. 
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(21.74) 
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Analogously to the fundamental Poisson bracket (19.29) from Example 19.7, we then 
find for the extended set of fundamental commutators 


{q/jL, <7v}_ = 0, {pfi, Pv}_ = 0, [q^, p v }_ = ih8^ v , /x,v = 0,...,n. 

(21.75) 


Equations (21.75) are obviously satisfied if 


<7/x — #/x> 


Pv = 



(21.76) 


For, if we let the commutator { q M , p v }_ of the operators q^ and p v act on an explicitly 
time-dependent function \/f(qx) = ^(qi, • • •, q n , t), we get 




= ifiSn V xlr(qx). 


(21.77) 


Because of qo = ct , the momentum operator for the index v = 0, i.e. po = —e/c, has 
the alternative representation 

3 

e = ih—. (21.78) 

dt 

Parallel to the momentum operators pi = —ihd/dqi that are conjugate to the config¬ 
uration space variables qi , one thus finds in the extended description the operator e 
for the system’s instantaneous energy content as the conjugate quantity of the time 
variable t. 

Furthermore, in the extended Hamiltonian formalism of canonical quantization, 
the extended Hamiltonian H\ = 0 from (21.25) is replaced by the extended Hamilton 
operator H\ = 0 

- dt ~ dt ( - 3 \ 

H ' = 7-s <H - e) = A H - ih V,) =0 ' (21 ' 79) 


with H denoting the related conventional Hamilton operator of the given quantum 
mechanical problem. As long as the operator equation H\ = 0 is not submitted to 
an extended canonical transformation, we are allowed to identify the time t with the 
system’s evolution parameter s, (t = s ). We thereby find an operator equation that is 
no longer Lorentz invariant 

9 

H-ih—= 6. (21.80) 

3 1 

If we let these operators act on an explicitly time-dependent function \j/(qi ,t), we get 
the following partial differential equation 


H\[r(qi, t) = ik 


d'lfjqiit) 
3 1 


(21.81) 
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In the realm of quantum mechanics, this equation is referred to as the Schrodinger 
equation. 


EXAMPLE 


21.15 Regularization of the Kepler System 

As a first example of an extended canonical transformation that includes a mapping of 
the time scale of the given dynamical system, we discuss the generalized formulation 
of L. Euler’s regularization of a Kepler system. This technique is commonly referred to 
as the “Kustaanheimo-Stiefel (KS)” transformation, and sometimes also as the “Hopf” 
transformation. It has the properties (i) to ensure the regularization of the equations 
of motion, (ii) to permit a uniform treatment all three types of Keplerian motion, and 
(iii) to transform the equations of the two-body problem into a harmonic oscillator 
form. We formulate this transformation here as a generalized canonical transforma¬ 
tion , where in addition to a transformation of spatial and momentum coordinates the 
physical time t of the original Kepler system is mapped into a new time T that para¬ 
meterizes the motion within the transformed system. We write the Hamiltonian of the 
Kepler system in normalized form 

H(qj,Pj) = ^(Pi+pj + pf) ~ A K = G(mi+m 2 ), r 2 =q\ + q\ + q\. 

(21.82) 

Herein, G denotes the gravitational constant, mi, m 2 the masses of the interacting 
bodies, and r their distance in the 3-dimensional configuration space. As H does not 
depend on time explicitly, we have dH/St = de/dt = 0, and hence 

1 K 

e =-(/>i+ p\ + Pj) ~ ~ = const. (21.83) 

Obviously, in this description the system has a singularity for r —>► 0. We will now 
show that the Kepler system (21.82) can be canonically transformed into another 
Hamiltonian system that does not exhibit any singularities. This canonical transfor¬ 
mation can be defined in terms of a generating function of type , 

?3 (Qj, Pj, T, e) = - l -pi{Q\ -Q\-Q\+ Ql) -P2 (QiQ2~ Q 3 Q 4 ) 

-P3(QiQ3 + Q2Q4) + e f Z(T)dT. (21.84) 

Jo 

As the generating function is linear in the pi and in e , the KS-transformation consti¬ 
tutes an extended canonical point transformation. It depends on an as yet undetermined 
time function i=(T) and has the particular feature that the transformed system has four 
degrees of freedom in place of three of the original system. We will see that this 
particular correlation of both systems gives rise to some freedom in fixing the initial 
conditions of the transformed system. According to the transformation rules (21.66), 
the old spatial coordinates qt are expressed in terms of the new ones, Qi, as 
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q\ = - 


9JT 3 

dpi 


\{Q\-Ql-Ql + Ql), 


<72 = ■ 


<73 = 


dp2 

9^3 

9t?3 


= Q 1 Q 2 ~ Q 3 Q 4 , 
= Q 1 Q 3 + QlQ4- 


We directly verify that 


(21.85) 


r — + ,I + ,| — — ( 2 i + Q 2 + 63 + 64 )- ( 21 . 86 ) 

The momenta Pi of the transformed system follow from the generating func¬ 
tion (21.84) as 


Pi=~ 


P 2 = 


P3 = ~ 


P 4 — 


dj -3 

Hh 

dJ~3 

Hh 

dT 3 

Hh 

dJ~3 


924 
which yields 


= Pl2l +T222 + T323, 
= ~P\Ql + P2 2l + P3 24, 
= -Pi23 - P 2 Q 4 + P3Q\, 
= PIQ 4 - P 2 Q 3 + P 3 Q 2 , 


2 2 2 P l + P 2 + P 3 + P 4 

Pi + Pl + P3 = q2 + q2 + q2 + q2 - 


The transformations of energy e and time t are 


dJ -3 

~^= e H(T), 

01 


9^3 

de 


l 


H*)d t. 


(21.87) 


( 21 . 88 ) 


(21.89) 


From the transformation rule for the conventional Hamiltonians from (21.62), H' is 
finally obtained as 


H'(Q j ,P j9 T) = H(q j9 p j ,t)S(T), 


(21.90) 


which is, as expected, in agreement with the transformation rule of their values, E 
and e. In explicit form, the transformed Hamiltonian Pj, T) is then found by 

expressing the original Hamiltonian in terms of the new variables 


H\Qj,Pj,T) = 


HT) 


-(Pf+Pi : + Pi + P^)-2K 


. (21.91) 


Ql + Ql+Qj+Qjll 

The constant energy e of the original system writes in terms of the new coordinates 

\{P\ + Pl + Pl + Pl) - 2 K 


q\ + q 2 2 + q] + qI 


- const. 


(21.92) 
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From H ', the canonical equations of the transformed system evaluate to 


dH f 

Wt 

dH' 

Wi 


HT) 


dQj 
dT 

dP' L 

dT ~ ( Q\ + Q\ + Ql + Q \) 2 


Qi + Qi + Qi + Qi 

2 HT) 


Pi, 


- (Pf + P\ + P] + Pi) - 2 K 


Qi- 


We may merge the pairs of first-order equations into second-order equations for the 
Qi,i = 


d 2 Qi 


dT 2 ${T)dT 


— 2e 


HT)~ 


HT) 


Ql + Qj + Qj + QjdT 


{qI + qI + qI + qD 


dQi 

~dT 


-i2 


LQi + Q 2 2 + Q 2 3 + Q 2 4 J 


Qi = 0 . 


(21.93) 


After having worked out the equations of motion of the transformed system, we are 
now in the state to fix the as yet undetermined time function i;(T). With the trans¬ 
formed canonical position coordinates conceived as functions of the transformed time, 
Q t = Qi (r), we may define 


S(T) = Q\(T) + Qj(T) + Ql(T) + Q 2 4 (T). 


(21.94) 


By virtue of the fixation of the relation of the physical time t of the Ke¬ 

pler system to the time T of the transformed system is uniquely determined 
through (21.89) 


t(T) = f T [Qj(T) + Qj(T) + Q\{ t) + Q 2 4 (T)\dr. (21.95) 

Jo 

Note that the identification of %(T) with the time evolution of a function the canonical 
variables does not mean that %(T) acquires an explicit dependence on the canonical 
variables. With this particular scaling of the transformed time T , the equations of 
motion (21.93) simplify to 


d 2 Q, 
dT 2 


2 eQi = 0 . 


(21.96) 


For e < 0, the orbit is closed in the original Kepler system. In the transformed system, 
we then get four uncoupled equations of motion of the time-independent harmonic 
oscillator, which we already know to be analytically solvable. 

Equations (21.96) can be regarded as the equations of motion that emerge from the 
canonical equations of the Hamiltonian 


Pj ) = l -(Pt + P 2 2 + Pi + Pi) - e(Qj + Q\ + Ql + Q 4 ). (21.97) 


By means of the relation (21.92), we immediately find the constant value E" of the 
Hamiltonian (21.97) 
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E" — IK — const. 


(21.98) 
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The original Kepler system may now be solved according to the scheme sketched 
at the end of Example 19.3. We must first transform the given initial conditions 
#i(0), #2(0), #3(0) and p i(0), /?2(0), P3(0) of the Kepler system into the initial con¬ 
ditions Gi(0), G 2 (0), G 3 (0), G 4 (0) and i>i(0), P 2 (0), P 3 (0), P 4 (0) for (21.96). This 
can be worked out by inverting the transformation rules (21.85). For a unique inverse 
to exist, we must choose one constraint, which may be defined for convenience as 

O 4 (0)= f 0. (21.99) 

With this setting, the initial values of the configuration space variables of the trans¬ 
formed system are 

2l(0) = y/ <?i (0) + r(0), Qm= 3A, 23(0) = ^r. (21.100) 

yi(0) q i(0) 

The initial momenta P t (0) are then directly obtained from the general transformation 
rules (21.87). Now, the harmonic oscillator equations (21.96) may be solved analyt¬ 
ically to find the solutions Qi(T), Pi (T) at time T. The configuration space coordi¬ 
nates qi(T) at time T of the original Kepler system are then found from the trans¬ 
formation rules (21.85). The corresponding momentum coordinates pt(T) at time T 
of the original Kepler system follow by solving the transformation rules (21.87) for 
the pi 


Pi = —{Q\P\ ~ QiPi ~ Q3P3 + G 4 ^ 4 ) 

2 r 

P2 = —( 02^1 + Q1P2 - Q4P3 - Q3P4) 

2 r 

P 3 = —(63^1 + Q4P2 + Q1P3 + Q2P4) 

2 r 

0 = — (—G 4 ^i + Q3P2 - Q2P3 + Gi^ 4 ). 

2 r 

The remaining task is to invert the analytic solution of (21.95) to find the represen¬ 
tation T(t). We can then finally express the solutions qi(T), Pi(T) in terms of the 
Kepler’s system time t to obtain the # ; - (t) and the pi (t). 


EXAMPLE 


21.16 Time-Dependent Damped Harmonic Oscillator 

As another example for an extended canonical transformation we will show that the 
time-dependent harmonic oscillator with also time-dependent damping coefficient can 
directly be mapped into a conventional (time-independent) undamped harmonic oscil¬ 
lator. Written for n degrees of freedom, the Hamiltonian of the original system is given 
by 


H(q r p h t) = -e 


-Fit) 




i =1 


i =1 


( 21 . 101 ) 
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with co 1 (t) and F(t ) denoting arbitrary , not necessarily periodic, differentiable func¬ 
tions of time. The subsequent equations of motion follow as 


Pi=e F{t) qi, qi + f(t)qi+co 2 (t)qi = 0, f(t) = F(t), i = 

( 21 . 102 ) 


As the “target system” H f —with T the independent variable—we demand the ordi¬ 
nary time-independent and undamped harmonic oscillator, 


H'(Qj,Pj)= 1 -f2 P i+l Q2 f2Qi- 


i =1 


i =1 


(21.103) 


In the context of the generalized canonical transformation theory, this means that the 
transformed time T represents a cyclic coordinate — which means that the conjugate 
coordinate energy E represents a constant of motion. 

The generating function Pj,t,E) that defines the desired mapping of the 

Hamiltonian (21.101) into (21.103) turned out to be 


^2 (qj,Pj,t,E) 



l e rn\m 

4 U(0 


- E 


f 1 dr 
'o £(*)' 


m 


E«< 2 


(21.104) 


Herein, § (t) denotes an as yet undetermined , hence a priori arbitrary differentiable 
function of time. According to the general transformation rules (21.65) for generating 
functions of type Ti, the particular rules for the transformation of canonical coordi¬ 
nates and time follow as 



o \/ga 

- ^/)v / e 7 7? )\ p i) 


f 


dz 


(21.105) 


As the new spatial coordinates Qi and the new time T depend on the old spatial 
coordinates qt and the old time t , respectively, we are actually dealing with a point 
transformation. The transformation of the time scales of both systems is governed by 
the as yet undetermined time function %(t). 

In terms of the new coordinates, the transformation rule for the energy e = 
—dJ~ 2 /dt is found from our T 2 as 


E=$e+ l -{ft;-k)Y i QiPi + \ & -P+m - is 1 - ff) E Ql 

i i 

(21.106) 

Because of dT/dt = 1 /§(*), the relation of old and new Hamiltonians, H and H', 
follows from the general rule from (21.62), yielding 

H' -E = %(t)(H -e). 

With H the original Hamiltonian from (21.101), we get the new Hamiltonian 
H'(Qj, Pj,T) by eliminating the old variables according to the transformation 
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rules (21.105) and (21.106) 

1 n 1 n 


1 = 1 


i—\ 


1 tt ^ A2 . s-2 2 1 s-2 f2 1 s-2 f 

2 $$ ~4 $ +$ M ~4 $ f ~2 ? f 


(21.107) 


This is obviously the desired Hamiltonian H f of the ordinary harmonic oscilla¬ 
tor (21.103), provided that its constant coefficient Q 2 is identified with the terms in 
brackets of the transformed Hamiltonian (21.107) 

n 2 = ~ + “ \ fl ~ = const (2L108) 

Correspondingly, the transformed Hamiltonian H' does not explicitly depend on time 
if and only if dQ 2 /dt = 0, which means that the third-order equation 

i + i(4« 2 - 2/ - f 2 ) +i;(4axb -/-//)= 0 (21.109) 

must be satisfied. As a consequence of this requirement, depending on the given exter¬ 
nal functions oo 2 (t) and f(t) , the function § (t) is now determined. This means, further¬ 
more, that the particular correlations of canonical coordinates, energy, and time scales 
of both systems are now pinpointed according to the transformation rules (21.105) 
and (21.106). With ^(0 a solution of the linear and homogeneous third-order equa¬ 
tion (21.109), then Q 2 = const., and hence the value E = E( 0) of H' constitutes a 
constant of motion. Thus, it is the freedom associated with extended canonical trans¬ 
formations to arbitrarily adjust the time scales of the involved systems that enables us 
to free the target system H f from its explicit dependence on the independent variable. 

If we express the constant value of the new Hamiltonian H'(Qj, Pj) = E = const, 
in terms of the old coordinates pi and qi , then we isolate an invariant of the original 
system (21.101), 

E = \ e ~ F $ EjpI - -?/) X^iPi 

i= 1 i=l 

+ V (,) (! -£/-$/ + 2(21-110) 

i= 1 

The time function § (, t ) can be attributed a physical meaning. We easily convince us 
that 

n 

HO = e f(0 X>? ( 0 (2i.ni) 

i= 1 


represents a solution of (21.109), provided, of course, that all qi(t) are solutions of 
the equation of motion (21.102) of the time-dependent damped harmonic oscillator. 
Inserting (21.111) into the representation (21.110) of the invariant E , then the latter 
takes on the equivalent form 

(21.H2) 

i i i ij 

The actual invariance of E can be proved directly by calculating its time derivative. 
Obviously, the invariant (21.112) of the time-dependent damped harmonic oscilla¬ 
tor (21.102) has exactly the form of the conservation law for the angular momentum 
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in central force fields. In the realm of accelerator physics, the quantity £ rms = \fE/n 
is referred to as the “root-mean-square (rms) emittance.” The “rms-emittance” of a 
charged particle beam is thus invariant along the beam axis, as long as (i) the particle 
motion may approximately be described by linear equations of motion and (ii) the 
number of beam particles is maintained. 

Inserting (21.111) into (21.108) we finally find that the invariant E coincides with 
the coefficient Q 2 from the transformed Hamiltonian (21.103) provided that %(t) is 
given by (21.111), 

Q} = E. (21.113) 


Having related the time-dependent damped harmonic oscillator (21.101) by means of 
an extended canonical transformation with the ordinary (time-independent) harmonic 
oscillator (21.103), we may now explicitly work out the solution functions qi (t) and 
Pi(t) of the equations of motion (21.102). The explicit solutions Qi(T) and Pi(T) of 
the ordinary harmonic oscillator are known from Example 19.3 

(Qi(T)\ ( co&SiT Q- l sin£2T\ 

\-CisinCiT cos CiT J\Pi(0)J' 1 ' ' 


The solution functions qi(T) and pt(T) of the time-dependent damped harmonic os¬ 
cillator follow subsequently as the product of the solution (21.114) with the transfor¬ 
mation (21.105), 


qi(T) \ = ( . ^ 0 \/ cosQT Q ^in^rX 

Pi(T)J ~ y±(£ - %f)yJe F /§ ) V-^sin^T cos^T ) 


x 


/ Qi ( 0 )\ 
\Pi(P)J' 


(21.115) 


The initial conditions Qi(0) and Pi (0) can, furthermore, be expressed through the 
qi( 0) and the pi( 0) by means of the inverse transformation of (21.105) at time t = 
T = 0, 


(Qi ( 0 )\ = / _ o \ fq, ( 0 )\ 

\Pi(o)J \-i(|(o) -t(o) ) U(°)/' 

(21.116) 

We switch to the time scale of the original system by expressing T in terms of t 
according to the corresponding transformation rule from (21.105). The solution of the 
time-dependent damped harmonic oscillator from (21.102) is thus finally given by 





T(t) = 


dr 

(o 


(21.117) 


Herein, %(t) denotes the uniquely determined solution of (21.108) for given initial 
conditions §(0), §(0) and a fixed Q 2 = const. The elements of the solution matrix 
R(t ) are obtained by multiplying the three matrices involved, 


ru(t) = 


iHOe 


F( 0 ) 


%(0)e F (0 _ 


cos 


sin^T(0 


QT(t)- 1(|(0) -/(0)$(0)) ^ 
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rn(t) = 
^21 (0 = 


mm sin QT(t) 


e F{ 0 ) e F(t) 
le F (0) e F(t) 


Q 


(21.118) 


- (I (f) - f(t) (0) + /(0) $(0)) cos SI Tit) 


V mm L2 
- {2(#(°) - mm)(m - mm) + 


sin SIT it) 


mit) = 


me p V 


me 


F( 0 ) 


cos sit( t) + -(m-mm) 


Si 

sin QT(t)' 
Q 


For all times t, the determinant D = r\\r 22 — ^ 12^21 of matrix R(t) has the value 
D = 1. The linear mapping (g(0),/?(0)) i-> ( q(t), p(t )) is thus in agreement with 
the requirement of Liouville’s theorem. For the particular case Q 0, we find that 
(sin QT)/Q -> T. In that case, the particle motion in a time-dependent damped har¬ 
monic oscillator are mapped into free particles. 

The rule for the transformation of time t —> T that emerges from the generating 
function (21.104) has the particular property that the transformed time T does not de¬ 
pend on the coordinates of the particles. Exactly for that reason, the transformed time 
T maintains the property of the original time t to be a common coordinate for all parti¬ 
cles in the transformed system. This means that T may serve as the common evolution 
parameter of all particle coordinates Qi(T) and Pi(T) in the transformed system. In 
other words, T has the global property of the transformed system’s time. Mathemat¬ 
ically, T has this property due to the fact that the (Pi , q{) and the (E,t) terms in the 
particular generating function (21.104) are additive. Therefore, this extended canon¬ 
ical transformation can be split into a conventional canonical transformation plus a 
pure canonical time-energy transformation from Example 21.11. This is not always 
possible as extended canonical transformations can be defined that do not admit a sep¬ 
aration of the transformation of space and time coordinates. We will encounter such 
a canonical transformation in Example 21.18, where the Lorentz transformation is 
formulated as an extended canonical transformation. 


EXAMPLE 


21.17 Galilei Transformation 

Galileo’s principle of relativity constituted until its absorption as a limiting case into 
Einstein’s principle in the year 1905 the most undoubted principles of classical dy¬ 
namics. It stated that there exists an “absolute time” t that is instantaneously common 
to all coordinate systems, how distant apart these systems ever may be located. If we 
consider the special case of two coordinate systems that are moving with respect to 
each other along one coordinate axis at a constant velocity v, the transformation rule 
for the positions q, Q and times t,T of a moving body of mass mo between these two 
systems is simply 

q = Q + vt, t = T. 

Formulated as an extended canonical transformation, the generating function of type 
T 2 of the Galilei transformation is then 


JF 2 (g, P , t, E) = Pq — Et — v(Pt — m$q). 


(21.119) 
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The complete set of transformation rules of the canonical coordinates is then 


9JT 2 9JF 2 

p=—— = P + m 0 v, e = —— 
dq dt 

n 9^2 9^2 

dP H dE 


= E + vP, 


u^ri / 

T = - -=t, H =Hi. 

Q £7 ’ 1 1 


( 21 . 120 ) 


From the general transformation rule for extended Hamiltonians, H[ = H\ , the rule 
for the conventional Hamiltonians H f and H is then obtained according to (21.62) 
with 3 T/dt = 1 as 

H = H' + vP. (21.121) 


As required, the rule for the Hamiltonians is in agreement with the rule for their values, 
e and E. 


Example 21.17 


EXAMPLE 


21.18 Lorentz Transformation 


The correct transformation rule between coordinate systems that move with respect 
to each other at constant velocity (“inertial systems”) is based on the finding that the 
velocity of light, c, is actually finite. This rule is referred to as the “Lorentz transfor¬ 
mation.” A finite c constituting the upper speed limit for any signal obviously means 
that a finite time span is needed for the signal to pass from one reference system to 
another. As an immediate consequence, Galileo’s concept of an “absolute time” that 
is instantaneously common to all inertial systems had to be abandoned. Instead, it is 
obviously necessary to also transform the time t if one performs the transition from 
one inertial system to another. 

The special principle of relativity requires that the formulation of the description 
of a physical system must be the same in all inertial systems. This means in particu¬ 
lar for Hamiltonian systems that the Lorentz transformation must maintain the form 
of the Hamiltonian. On the other hand, as we know from the preceding Chap. 19 
and Sect. 21.3, only canonical transformations maintain the form of the canonical 
equations. We conclude that the Lorentz transformation must constitute a particular 
canonical transformation. As the Lorentz transformation necessarily associated with a 
transformation of the time coordinate, t T, it may be described only in terms of an 
extended canonical transformation. Its extended generating function Ti is given by 


2, P,t, E) = y 


Pq — Et — v\ Pt - 


( 21 . 122 ) 


with v denoting the constant relative velocity of the respective inertial systems. In the 
formulation given here, the coordinate systems are adjusted to ensure that the relative 
motion of both systems occurs along one coordinate axis, q. As usual, we denote by 
y the dimensionless length and time scaling factor y = 1/y/l — /3 2 , with ft = v/c 
the scaled relative velocity. We observe that the generating function (21.122) of the 
Lorentz transformation merges into that of the Galilei transformation (21.119) from 
Example 21.17 for i; «c, hence for f 0, y 1, E/c 2 = m mo. Namely, the 
total mass m = E/c 2 in (21 . 122) is replaced by the constant rest mass mo in the case of 
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the Galilei transformation. With regard to the transformation rules that emerge from 
the generating functions, it is exactly the replacement of the second ^-dependent term 
in (21 . 122) by the constant mass term mo in (21 .1 19) from Example 21.17 that induces 
the time transformation rule t = T of the Galilei transformation. 

For the generating function (21.122), the general transformation rules (21.65) for 
extended canonical transformations yield the particular rules 


P 

Q 


dT 2 
d q 

JF 


= r 



= y(q - vt), 


e 


8T 2 

dt 


= y(E + vP), 



(21.123) 

H[ = Hi. 


The transformation rules for the variables Q and T follow as 





(21.124) 


The canonical transformation approach to represent the Lorentz transformation en¬ 
sures that we simultaneously obtain the rules for conjugate coordinates, P and E , 









(21.125) 


With the real angle a = arcosh y = arsinh/1]/, the linear transformations (21.124) and 
(21.125) can be rewritten as orthogonal mappings, hence as the imaginary rotations 



( cos iot 
— sin iot 


Sima 
cos iot 




( cos ia 
— sin ia 


sin ia 
cos ia 



Obviously, these transformation maintain the “distances” 

_ C 2 T 2 = q 2 _ c 2 t 2 , p 2 _ ^2 = p 2 _ * 2 /c 2 . 


(21.126) 


(21.127) 


With dT/dt = y, the transformation rule for conventional Hamiltonians, H r and H , 
under Lorentz transformations follows according to (21.62) as 

(H' -E)y = H -e. (21.128) 


Together with the rule (21.125) for the transformation of the energies e , E 

e = yE + /3yPc (21.129) 

the transformation rule for Hamiltonians under Lorentz transformations is finally 
found from (21.128) as 

H = yH f + PyPc. (21.130) 

As expected, the Hamiltonians H, H' transform equally as their respective 
values, e, E. 
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As the Lorentz transformation constitutes an extended canonical transformation, 
the extended version of Liouville’s theorem from Example 21.12 applies, 


d(Q,P,T , E ) _i 
d(q,p,t,e ) 


(21.131) 


On the basis of the transformation rules (21.124) and (21.125), we easily verify that 
the requirement (21.131) is indeed fulfilled 


d(Q,P,T, E) 
d(q,p,t,e ) 


9(8, T) d{P,E) 
d(q,t) d(p,e) 



-cfiy 


Y 


Y 


-cfiy 


zPy 

c 

y 


[y 2 (i-P 2 )] 2 = i. 


Yet, we must be aware of the fact that the forms dq dp and dt de of the projection 
planes ( q , p) and ( t , e) are not invariant under Lorentz transformations 


m,p) = 2 we g) = 2 
3 (?, p) K ’ 3 (r,e) K 


(21.132) 


Thus, under Lorentz transformations, the conventional form of Liouville’s theorem 
from Example 19.6 does not hold! 


Example 21.18 


EXAMPLE 


21.19 Infinitesimal Canonical Transformations, Generalized Noether Theorem 


We define the following generating function of an infinitesimal extended canonical 
transformation that generalizes the infinitesimal time-step transformation from Exam¬ 
ple 19.5, 


J~2(qvi Pv) — 'y ' qn p n ^ I (qv, Pv)- 

li= o 


(21.133) 


Herein, € denotes the infinitesimal parameter and I(q v , p v ) a differentiable function 
of all variables of the extended phase space (/x, v = 0,..., n). The coordinate trans¬ 
formation rules follow as 


3^2 3/ 

Pfi ~ 3 - — P/i + 6 7 , -» 

9^/x 3 q^ 

_ df 2 _ 3/ 

QlX ~ 


(21.134) 


To first order in 6, we derive the variations 8 p M and Sq^ from the transformation 
rules (21.134), 


$PfM = E/i - Pii = 


3/ 

Hu' 


H/jl — Q/jl q /x 



(21.135) 
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The total s-derivative of I (q v , p v ) is then 
dl x—v/ 9/ dq /x 9/ dp 


Cl 1 ^ ^/ (7 x CLCfjj^ U1 ClfJ 

ds dq M ds dp^ ds 


— / I &Pii c, T- 9^ I 

rv Hu) 


/X 

= -9/fi. 


(21.136) 


Thus, by means of the canonical equations (21.23) and the first-order transformation 
rules (21.135), we have found that the characteristic function I(q v , p v ) that is con¬ 
tained in the generating function (21.133) constitutes a constant of motion exactly if 
the extended Hamiltonian H\ is invariant under the transformation (21.135) generated 
by (21.133). In other words, if the rules (21.135) define a symmetry transformation 
of the given Hamiltonian system then the characteristic function I (q v , p v ) of the gen¬ 
erating function (21.133) constitutes a constant of motion. The correlation (21.136) 
of a Hamiltonian system’s invariants I to the symmetry transformations that main¬ 
tain the value of its extended Hamiltonian H\ establishes Noether’s theorem 1 of point 
mechanics in the extended Hamiltonian formalism 

— =0 «5ffi=0. (21.137) 

ds 

The derivation of Noether’s theorem in the context of the Lagrangian formalism 2 is 
restricted to extended point transformations (see Example 21.10). Yet, the extended 
canonical transformation approach allows to describe more general possible symme¬ 
try mappings as the rules (21.134) are not restricted to point transformations. Con¬ 
sequently, (21.137) in conjunction with the infinitesimal canonical mapping (21.135) 
represents a generalized formulation of Noether’s theorem. 

The variation of an arbitrary function u(q v , p v ) under the infinitesimal canonical 
transformation that is induced by I(q v , p v ) according to the rules from (21.135) is 
then 


1 Amalie “Emmy ” Noether, German mathematician, b. March 2, 1882, Erlangen, Germany-d. April 
14, 1935, Bryn Mawr, Pennsylvania, USA. Emmy Noether grew up in Erlangen in the family of a 
mathematician and passed the state examination for teachers of foreign languages. When in 1903, 
women were for the first time allowed to study at Bavarian universities, she took up studies of mathe¬ 
matics in Erlangen and graduated in 1907. Following an invitation by Felix Klein and David Hilbert, 
she then moved to Gottingen. She was not allowed a habilitation, the German qualification to become 
a professor, until 1919, and only in 1922 she became an assistant professor, and the first paid job as 
a professor in 1923. In 1928/29 she was a guest professor in Moscow, and on 1930 in Frankfurt am 
Main. Because of her Jewish descent and her political views, she was forbidden to teach in 1933. 
Noether emigrated to the US, where she found a position as a guest professor at the Bryn Mawr 
Women’s College. 

Emmy Noether’s work on the theory of invariants, the theory of ideals and of rings and modules 
was instrumental to the development of modern algebra. Her theorem linking continuous symmetries 
of a physical system to conserved quantities is one of comer-store concepts of modern physics. 

2 In the original publication of Emmy Noether (“Invariante Variationsprobleme,” Nachr. Kgl. Ges. 
Wiss. Gottingen, Math.-Phys. Kl. 1918, 235), the theorem was presented for continuous systems in 
the Lagrangian formalism. 
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E ( du du \ 

a- Hfi + 7- S Pfi 

/t \Hn dPn J 

\ > / 9w 9/ 9w 9/ 
a ^9^ 9^ dp^dqn 


(21.138) 


In the notation of extended Poisson brackets that was introduced in Example 21.13, 
we may express the variation 8u concisely as 


8u = € [u, /] e . 


(21.139) 


As (21.139) applies for arbitrary differentiable functions u = u(q v , p v ), we may 
equivalently write it as an operator equation, 

8u = eU u. 


Example 21.19 


The operator U is thus given by 


U = [.Jf 


yY 9/ 9 

Bc tn 


9/ a 

Hu Hu 


(21.140) 


The dot in the Poisson bracket expression stands here as a placeholder for a function 
the operator U acts on. We refer to U as the generator of the infinitesimal symmetry 
transformation (21.135) that is associated with the invariant I. Obviously, I itself is 
invariant under the symmetry transformation (21.135) which it generates 


81 = eUI = €[/, /] e = 0. (21.141) 

Two invariants I\ and I2 of a given Hamiltonian system H then define the two sym¬ 
metry operators 

#i = [.,/i] e , £>2 = [.,/ 2 ] e . (21.142) 

The concatenations of the operators U\ and U2 can then be expressed as nested Pois¬ 
son brackets. Skipping the superscript e , this reads 


UiU 2 = U l [.,I 2 ] = [l.,l 2 lhl U 2 U l =U 2 [.,h] = [[.,hll 2 l (21.143) 

The commutator {U\,U 2 }_ of the operators U\ and U 2 then defines a generally not 
vanishing operator f/ 3 , which is represented in terms of extended Poisson brackets as 

U3 — \U\ ^ u 2 \ — u\ u 2 — U 2 U] 

- [[•, hi /1] - [[ •, /1], h] - ~[h, [., hi] - [h, [/I,.]] 

= [.,[/2,/l]] 

= [.,/ 3 ], with / 3 = [/ 2 ,/i]. (21.144) 

According to Poisson’s theorem from (19.32), the function I 3 = [I 2 ,1 1 ] represents 
another invariant of the respective Hamiltonian system. Consequently, the operator 
U 3 equally defines a symmetry transformation of the system. The commutator of two 
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generators of symmetry transformations thus provides another generator of a symme¬ 
try transformation. The group of generators of symmetry transformations of a Hamil¬ 
tonian system thus forms in conjunction with the commutator operation a Lie-Poisson 
algebra. For a given Hamiltonian system, we do not know a priori whether all its in¬ 
variants have been found, and hence whether the Lie-Poisson algebra of symmetry 
operators is complete. Yet, by applying Poisson’s theorem to all pairs of known invari¬ 
ants, it is always possible to find a subset of invariants Ij that is closed with respect 
to evaluating their mutual Poisson brackets. With respect to the set of generators of 
symmetry transformations we thus find a sub-algebra for pertaining symmetry opera¬ 
tors Uj. 


EXAMPLE 


21.20 Infinitesimal Point Transformations, Conventional Noether Theorem 


As we are dealing with an infinitesimal transformation, we may eliminate the original 
coordinate from the generating function (21.133) of Example 21.19. Solving for /, 
we rewrite this equation to first order in € as 


e i (q v , Pv) = Ti(q v , 



9/ 

*9^ 


— ftiqv, Pv) y ' QnPn +£ y ' P/I „ 

u—0 II—0 Pf 1 


— (#v> Qv) H” £ 'y ' Pii 

!i=0 


dl 

dPu' 


In the last step, we have replaced the generating function of type (q v , Pv) by an 
equivalent function of type T\ (q v , Q v ) according to (21.63). In our case of an infini¬ 
tesimal transformation, the generating function T\ may alternatively be expressed to 
first order in e as 


Fliqv, Qv) =F\(q v Jqv) = € f(q v ). 

The function I (q v , p v ) may thus be expressed as 
^ d I 

I{q v ,Pv) = Y J P^— + f{qv). (21.145) 

U d P 

This equation is obviously fulfilled for functions I (q v , p v ) that are linear in the p v 

n 

H.q v , Pv) = -^2 PuP^iqv) + f(q v )- (21.146) 

/i=0 

The functions rj^(q v ) in (21.146) are defined to only depend on the extended set of 
canonical variables q v , hence on the configuration space variables qj and time t in the 
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conventional description. With this I(q v , p v ), the generating function (21.133) from 
Example 21.19 defines the extended point transformation 

6/z =qfi -erjpiqy), n = 0 (21.147) 

We may write (21.146) equivalently in the conventional description by replacing the 
canonical variable po = —e/c according to (21.20) with the Hamiltonian H(qj , Pj,t). 
Furthermore, the system evolution parameter s must be replaced with the physical time 
t as the independent variable. With § = rjo/c , the special Noether invariant (21.146) 
then writes 

n 

i = %(qj,t) h - 7(21.148) 

i =1 

This defines the conventional Noether invariant of point mechanics. If we can find 
functions §(<?;, t), qt(qj, t ), and f(qj , t) for a given Hamiltonian H(qj , pj , t) that 
satisfy dl/dt = 0, then I from (21.148) constitutes a constant of motion. The Hamil¬ 
tonian form of Noether’s theorem presented here then states that the corresponding 
symmetry of the given Hamiltonian system is given by an extended canonical point 
transformation that is determined by those functions § ( qj , t) and rn (< qj , t) 

T = t -€^(q j9 t), Qi =q t -eiyiqj, t). (21.149) 

As a result of the fact that the Noether theorem from (21 . 148) represents only a special 
case, not all invariants of a given system can be expressed by (21.148) in general. As 
we see from the symmetry transformations (21.149) that are associated with invariants 
of the form (21.148), only those symmetries that are represented by extended point 
transformations are covered by the conventional Noether theorem. 

Invariants that are not of the form of (21.148) are commonly referred to in liter¬ 
ature as “non-Noether invariants,” an example of which we will encounter in Exam¬ 
ple 21.21. The symmetry transformations associated with “non-Noether invariants” 
do not constitute extended point transformations, and hence do not emerge straight¬ 
forwardly in the context of the Lagrangian formalism. 


EXAMPLE 


21.21 Runge-Lenz Vector of the Plane Kepler System as a Generalized Noether 
Invariant 

The classical Kepler system is an example of a two-body problem whose masses in¬ 
teract according to an inverse square force law. In its plane version, the Hamiltonian 
of this system writes in scaled coordinates (see also Example 19.9) 

1 9 1 9 K 

H=~p\+~p\- i - (21.150) 

2 2 

The canonical equations follow as 

qi=Pi, pj=-K—J± - , f = 1,2. (21.151) 

fiq i +<?f ) 3 


Example 21.20 
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As the characteristic function I = I (q\, q 2 , pu P2, 0 that is contained in the gen¬ 
erating function for an infinitesimal canonical transformation (21.133) from Exam¬ 
ple 21.19, we define 

I = -PiP 2 r]i(qi,q 2 ,t) - pjr] 2 (qi,q 2 ,t) + (21.152) 


The as yet unknown coefficients q\ (q \, q 2 , t), r) 2 (q \, qi, t), and f(q\,q 2 ,t) contained 
herein must now be determined accordingly to render I a constant of motion. With the 
physical time t the system’s independent variable, the condition therefore writes 


-r [~p\P2 m(qi,q2,t)~ p 2 miqi ,qi, t) + f(qi ,q 2 , t )] = o. 


(21.153) 


Inserting the canonical equations, we then find 
K 


y +«i ) 3 


(q\P2m +q2Piqi + 2 q 2 P 2 q 2 ) - p\P2 


( 3 m , dqi 3 m \ 


9 / 3?72 9^2 3?72 \ 9 / 3 / 3 / 

— pi (-1- p\ H- P 2 ) -1- Pi H- P 2 = 0. (21.154) 

F2 \dt 3qi dq 2 ) 3 1 dqi 3 q 2 


As this polynomial in the pj must be satisfies not only in one instant of time to but 
for all times t , we conclude that the coefficients of each power p^p™, n,m = 0,..., 3 
must vanish separately. We thus obtain here eight separate conditions, 


dip 

dqi 


= 0 , 


K 


9y?2 

9<?2 


= 0 , 


7 (4i 2 + q f) 3 


3/ 

? 2»?1 + 7 — 
3<7i 


3?n 9^72 = o 9^ =0 3^ =0 2^ = o 

3<?2 3<?i ’3? 3f 3f ’ 

K df K ' ’ 

i =(qim + 2<?2 m) + a— = o. 

M+ti 3 qi 


From the conditions in the upper line, the following particular solutions emerge 


qi=q2, m = -qi, f = f(q\,qi)- 


(21.156) 


We may now easily convince ourselves that the function 

/(<!.«) = -g , = (21.157) 

(/«?+«! 

is a solution of both conditions from the lower line. This shows that an invariant of 
the form of (21.152) exists. Inserting the solutions rj\, r]2, and / into (21.152) finally 
yields 

/RLi = ~P\P2q2 + p 2 qi - k —f=^=- (21.158) 

This constant of motion represents one component of the Runge-Lenz vector, which is 
referred to in literature of a “non-Noether invariant.” Due to its quadratic momentum 
terms, the invariant (21.158) cannot be expressed as a conventional Noether invariant 
of the form of (21.148) from Example 21.20. By systematically defining appropriate 
polynomial functions I (q v , p v ) of the p v , we may construct the invariants of Hamil¬ 
tonian systems from the solutions of systems of coupled partial differential equations. 













21.3 Extended Canonical Transformations 


453 


The symmetry transformation that is associated with the invariant Irr { follows 
from the rules (21.135) of Example 21.19 


<5<?i = -ep2q2, 

Sq 2 = e(2p 2 q\ - pm), 


Sp l 


- € [p 2 ~ 


K 


q\ 


/(q[+~qf^^ 


8 p 2 = e [p\p 2 - K 


qm 


yjiql + 4f) 3 ' 


As the variations of the spatial coordinates depend on the canonical momenta, 
this transformation is not a point transformation—and thus represents a generalized 
Noether symmetry. We finally get as the generator Ur = [., /rli ] of the pertaining 
symmetry transformation of the system 


9 9 

URU = -piqi- -h (2p 2 qi - piqi) — 

9 <71 9<72 


-\P2-K 


q\ 




+ \PlP2 - K 


qm 


y/iqi+qi') 


3/9/12 


Example 21.21 












Extended Hamilton-Jacobi Equation 


22 


In the context of the extended canonical transformation theory, we may derive an 
extended version of the Hamilton-Jacobi equation. We are looking for a generating 
function Qv , Pv) of an extended canonical transformation that maps a given ex¬ 
tended Hamiltonian H\ = 0 into a transformed extended Hamiltonian that vanishes 
identically , H[ = 0, in the sense that all partial derivatives of H^(Q V , P v ) vanish. 
Then, according to the extended set of canonical equations (21.23), the derivatives of 
all canonical variables Q^(s), P^is) with respect to the system’s evolution parameter 
s must vanish as well 


djn =0= do E 

3 Pn ds 


3 Hi dPu 

-- = 0 = — 

dQv ds 


li = 0 ,..., n. 


( 22 . 1 ) 


This means that all transformed canonical variables 2/z, Pfi must be constants of mo¬ 
tion. Writing the variables for the index fi = 0 separately, we thus have 


T —oi o = const., Qi ~ oii = const., E = —/3q = const., 


Pi = Pi = const. 


Thus, corresponding to the conventional Hamilton-Jacobi formalism, the transformed 
canonical variables, Qj and Pj, are constants. Yet, in the extended formalism, the 
transformed time T is also a constant. The particular generating function Pv) 

that defines transformation rules for the extended set of canonical variables such that 
(22.1) hold for the transformed variables thus defines a mapping of the entire system 
into its state at a fixed instant of time, hence—up to trivial shifts in the origin of the 
time scale—into its initial state at T — t (0) 

T = m, Qi=qt( 0), Pi = Pi ( 0), E = H(qj(P),pj(0),t(0)). 


We may refer to this particular generating function as the extended action function 
J~2 = S\(q v , Py). According to the transformation rule H[ = H\ for extended Hamil¬ 
tonians from (21.59), we obtain the transformed extended Hamiltonian H[ = 0 sim¬ 
ply by expressing the original extended Hamiltonian H\ = 0 in terms of the trans¬ 
formed variables. This means for the conventional Hamiltonian H(qj , pj, t) accord¬ 
ing to (21.25) in conjunction with the transformation rules from (21.65), 




W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_22, © Springer-Verlag Berlin Heidelberg 2010 


455 









456 


22 Extended Hamilton-Jacobi Equation 


As we have ds/dt ^ 0 in general, we finally get the generalized form of the Hamilton- 
Jacobi equation, 


H 



ds i ds i 

’ ” ’ ’ dq n 


a Si 

+ — =o. 

dt 


( 22 . 2 ) 


Equation (22.2) has exactly th z form of the conventional Hamilton-Jacobi equation. 
Yet, it is actually a generalization as the extended action function Si represents an 
extended generating function of type Ti, as defined by (21.63). This means that Si is 
also a function of the (constant) transformed energy E = —cPo(0) = —fo. 

Summarizing, the extended Hamilton-Jacobi equation may be interpreted as defin¬ 
ing the mapping of all canonical coordinates qj, Pj,t, and e of the actual system into 
constants Qj, Pj, T, and E. In other words, it defines the mapping of the entire dy¬ 
namical system from its actual state at time t into its state at a fixed instant of time, T, 
which could be the initial conditions. 


EXAMPLE 


22.1 Time Dependent Harmonic Oscillator 


As a simple example for the method to analyze an explicitly time-dependent Hamil¬ 
tonian system by means of the generalized Hamilton-Jacobi equation we choose the 
time-dependent harmonic oscillator 

l n i n 

H(qj, pj,t) = 2 Y Pi + -® 2 (0 ( 22 - 3 ) 

i=l i=1 


With Si = S\(q \,..., q n ,t, P \,..., P n , E) an extended action function, we encounter 
the following Hamilton-Jacobi equation (22.2) for this system 


8 S i 

~di 


+ 


5 E 


3SiV 

3 4i) 


1 , 
+ 2 - 


V)y> 2 =°. 


(22.4) 


The problem is now to find a solution Si for this nonlinear partial differential equa¬ 
tion. We start with generating function of the extended canonical transformation from 
Example 21.16, and restrict ourselves for simplicity to the case of zero damping, 
{F{t) = 0), 


Si (qj,t 




dr 

HO' 


(22.5) 


In the first instance, we require only the transformed energy E = —cPo(O) = —fio 
in (22.5) to represent a constant. We insert the partial derivatives of S) with respect to 
Pi , qi , and time t 


q. _ _ 9 i_ 

Ui ~dPi~Zfl 


pi = 


9 Si 
3 qi 


Pi 

V? 2^' 



1 § „ 1 

'2VF i + 4 



E 

J 


( 22 . 6 ) 





22 Extended Hamilton-Jacobi Equation 


457 


into the Hamilton-Jacobi equation (22.4). Expressed in terms of the transformed co¬ 
ordinates, we find 



1 

2 


i 


= 0 . 


(22.7) 


In this equation, the sum of all terms proportional to J2i Qf was denoted by Q 2 , 

For the required constant transformed energy E , (22.7) can only be satisfied for all t 
if Q 2 itself constitutes a constant of motion 

+ $ 2 co 2 (t) = tt 2 = const. (22.8) 

With %(t) a solution of (22.8), the transformation rules for the coordinates are now 
uniquely determined 


Qi = 


<k_ 

vr 


Pi = 4ipi 


i k 


(22.9) 


As the action function Si = S\{qj,t, Pj, E) constitutes an extended generating func¬ 
tion, the corresponding transformation maps the original time t into a new time T , in 
conjunction with the usual mapping of the canonical coordinates, Qj{T) and Pj(T). 
The correlation of new time T and original time t is determined by the solution § (t) 
of (22.8) 


Example 22.1 


T(t) = — 


dSi 

Je 


r f dr 
'o §00’ 


( 22 . 10 ) 


The transformed Hamiltonian H'(Qj, Pj, T) follows now from the general transfor¬ 
mation rule (21.62) for extended generating functions T with dT/dt = l/%. 


H' — E — e) 


( 22 . 11 ) 


which reads explicitly in the new coordinates Qi , Pi 

1 n \ U 

H\Q j ,P j )=-Y J P? + ^ 2 Y J Ql (22.12) 

i =1 i =1 

This Hamiltonian H r does not explicitly depend on T if and only if £(0 represents a 
solution of (22.8) with constant Q 2 . In this case, H r corresponds to the Hamiltonian 
of the ordinary (time-independent) harmonic oscillator. As required, the value of H', 
given by E = — fio , is now a constant of motion. In contrast, the canonical coordinates 
{Qi, Pi) of this system are not constant. The given task to map the time-dependent 
harmonic oscillator (22.3) into a system where all transformed canonical coordinates 
, P/x depict constants of motion is thus as yet performed for fio only. However, 
we may now in a second step transform the Hamiltonian (22.12) into a new Hamil¬ 
tonian that vanishes identically, H" = 0. To this end, we first set up the corresponding 
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Example 22.1 


Hamilton-Jacobi equation, 



(22.13) 


We now try to find a solution S = S(Qj,T, /3j) that depends on the constant trans¬ 
formed momenta fij = const. We may here restrict ourselves to a conventional action 
function S as the Hamiltonian (22.12) does not explicitly depend on the independent 
variable, i.e. the transformed time T . Due to the quadratic dependence of the Hamil¬ 
tonian (22.12) on the Qi and the Pi, we try to determine the solution S(Qj,T, fij) 
of (22.13) by defining 

5 = \ a(T) E Q] + E Gift + \ c ^ E A 2 - (22.14) 

i i i 

Inserting the partial derivatives 

21.=aQ i +bp i , ^ = ^Es 2 + ^EG;ft + ^Eft 2 (2Z15) 

z i i i 

into the actual Hamilton-Jacobi equation (22.13) yields 

\(a + a 2 + ^ 2 ) E Qi + ( b + ab ) E Gift + \(c + ^ 2 ) E A 2 = 0- (22.1H5) 

i i i 


This equation can only be satisfied for arbitrary Qi at all times T if the coefficients 
in (22.16) vanish separately 

a + a 1 + Q 2 = 0, b + ab = 0, c + b 2 = 0. (22.17) 


Starting with the leftmost non-linear first-order differential equation for a (T), we may 
solve this coupled set step-by-step 

1 tan£ 2 T 

a(T) = —^tan QT, b(T) = -—, c(T) =-—. (22.18) 

cos QT £2 

We thus find the following action function S(Qj,T, fij) as the solution of the 
Hamilton-Jacobi equation (22.13) 

1 x^ 9 1 x—> 1 tan Q T ^> 9 

s=--^t'£q} + — Eft 2 - 

i i i 

By means of this action function, we may now work out the relation of the coordinates 
Qi(T ) and Pi(T) with the integration constants and Pi. We thereby show that 
the action function S = F 2 represents exactly the generating function of a canonical 
transformation that map the Hamiltonian H f from (22.12) into a new Hamiltonian 
H" = 0 that vanishes identically. According to the general rules from (19.12), we 
obtain for (22.19) the particular transformation rules 
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H n 


dS 

~~Wj 

ds 

Wj 

dS 


cos QT 


-Pi 


tan QT 


Q 


— Qj £2tan£2T + 


Pj 


cos QT’ 


H' + — = H' - - 


1 Q z 


dT 


2 cos 2 QT 


E 9 Q sin £2 T ^> 

Qi + 2or E Q‘P‘ 

C0S z OT 


( 22 . 20 ) 


1 1 


2 cos£2T 


Ea 2 


Solving for the coordinates Qj and Pj this finally yields 
sin£2T 

Qj = otj cos £2T + pj - 


Q 


Pj = — (Y/^sin^r + Pj cosQT, 

H "= H '-\Y. p ?-\^Y.Q 2 i= °- 


( 22 . 21 ) 


With H" = 0 our task has been accomplished. The representation of the Qj and the Pj 
as functions of the integration constants otj and Pj means to have completely solved 
our system. We may, furthermore, merge the coordinate transformations of the first 
and second step 


'Vj 
\Pj v 



cos^r 


sin£2T 

Q 


aVW) VW)J \-^ sinQT cos QT 
wherein 0 ^ Q = const, and § (t) denotes a solution of 

The transformed time T then follows from 
T 



( 22 . 22 ) 


/' 


dr 

£(*)’ 


(22.23) 


(22.24) 


With the particular initial conditions §(0) = 1 and §(0) = 0, the constants otj, Pj ob¬ 
viously represent the values of the coordinates qj , pj at the instant of time t — 0. 

Both, the extended action function Si (qj, t, Pj, Pq) from (22.5), and the con¬ 
ventional action function S(Qj,T, Pj) from (22.19) define canonical transforma¬ 
tions. The concatenation of both transformations then establishes again a canonical 
transformation. In principle, it is thus possible to find an extended action function 
S[(qj, t, Pj, Po) = Pfj) that yields the solution of (22.4) with all P^ constants, 

which means to solve the problem in one step. 


Example 22.1 















Part V|| 
Nonlinear Dynamics 


The treatment of mechanics in these lectures would not be complete if we did not deal 
at least in brief with a topic which recently has attracted much attention: nonlinear 
dynamics, and thereof the “theory of chaos” as a special topic. 

The starting point is the observation that ordered and regular motions like those 
occurring in the harmonic oscillator, the pendulum, or the Kepler problem of plane¬ 
tary motion are more an exception in nature than the standard case. One frequently 
encounters erratic phenomena and phenomena that are unpredictable in the details. 
A particularly striking example is the occurrence of turbulence in the flow of liquids. 

Toward the end of the nineteenth century, the “father of nonlinear dynamics,” Henri 
Poincare, 1 for the first time pointed out that an irregular behavior in mechanics is not 
at all an unusual feature if the system being studied involves a nonlinear interaction. 
Closely related is the—at first sight astonishing—insight that also very simple systems 
may exhibit a highly complex dynamics. A simple deterministic differential equation 
involving nonlinearities may have solutions the behavior of which over longer time 
periods evolves quite irregularly and practically cannot be predicted. This is one of the 
characteristic features of chaotic systems. The meaning of this concept, which may be 
precisely defined in the frame of nonlinear dynamics, extends far beyond mechanics, 
since the phenomenon of chaos arises in many fields not only of physics but also of 
chemistry, biology, etc. 

In the following sections, we shall learn quite a lot about general properties of 
nonlinear dynamic systems. The time dependence and stability of their solutions will 
be discussed, and concepts like attractors, bifurcations, and chaos will be introduced. 


1 Jules-Henri Poincare , French mathematician and physicist, b. April 29, 1854, Nancy-d. July 17, 
1912, Paris. Poincare studied at the Ecole Polytechnique and the Ecole des Mines and was a scholar of 
Ch. Hermite. Soon after he received his doctorate, he obtained a chair at the Sorbonne in 1881, which 
he held until his death. In pure mathematics, he became famous as the founder of algebraic topology 
and of the theory of analytic functions of several complex variables. Further essential fields of work 
were algebraic geometry and number theory. But Poincare also dealt with applications of mathematics 
to numerous physical problems, e.g., in optics, electrodynamics, telegraphy, and thermodynamics. 
Together with Einstein and Lorentz, he founded the special theory of relativity. Poincare’s work on 
celestial mechanics, in particular, on the three-body problem, culminated in a monograph in three 
volumes (1892-1899). In this context, he was the first who discovered the appearance of chaotic 
orbits in planetary motion. Poincare has been called “the last universalist in mathematics” because of 
the unusually broad scope of his interests. 
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VII Nonlinear Dynamics 


However, a detailed treatment of nonlinear dynamics, its manifold problems, and in¬ 
terdisciplinary applications exceeds the scope of this book. 2 In particular, we cannot 
deal in more detail with the important topic of chaos in Hamiltonian systems. 


2 Some textbooks from the very extensive literature on nonlinear dynamics: 

H. Schuster, Deterministic Chaos, VCH Verlag (1989). 

G. Faust, M. Haase and J. Argyris, Die Erforschung des Chaos, Vieweg (1995). (This was also trans¬ 
lated into English and published as G. Faust and M. Haase, J. Argyris, An Exploration of Chaos, 
North-Holland (1994).) 

H. -O. Peitgen, H. Jurgens and D. Saupe, Chaos and Fractals: New Frontiers of Science, Springer 
(1992). 

R.C. Hilbom, Chaos and Nonlinear Dynamics, Oxford University Press (1994). 

G. Jetschke, Mathematik der Selbstorganisation, Deutscher Verlag der Wiss., Berlin (1989). 
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A unified theoretical description may be given for many of the systems of interest. 
A system is described by a finite set of dynamic variables that will be combined to 
a column vector x = (vi,...,v^) T e The state of the system at a given time t 
is uniquely described by such a point x in phase space. The X[ are generalized co¬ 
ordinates that may represent a variety of quantities. Note that the vector x shall also 
comprise the velocities (or momenta, respectively). We now assume that the system 
behaves deterministically. Thus, the entire time evolution x(t) is determined if an 
initial value x(to) is given. The time evolution shall be described by a differential 
equation of first order with respect to time: 

tLx(t) = F(x(t),r,X). (23.1) 

Here, F is in general a nonlinear function of the coordinates x (also called the ve¬ 
locity field or vector field). Moreover, F may also still explicitly depend on the time 
t , for example if varying external forces are acting on the system. If there is no such 
dependence, the system is called autonomous. Finally, the third argument in (23.1) 
shall indicate that possibly there exist one or several control parameters X. These are 
fixed given constants whose values affect the dynamics of the system and may pos¬ 
sibly change the character of the dynamics. Typical control parameters are, e.g., the 
coupling strength of an interaction, or the amplitude or frequency of an external per¬ 
turbation imposed onto the system. 

Note: A possible explicit time dependence in (23.1) may be eliminated by a simple 
trick. For this purpose we consider a system with one additional degree of freedom, 


X = (x\, . . . , Xn, X;v+l) T G R^ +1 , 


and postulate for the additional vector component the differential equation 
d 

~ r + XN +1 = L 
dt 

With the initial condition x#+i(0) = 0, this simply implies Jt#+i(0 = t. Hence, the 
time on the right side of t may be replaced by x^+i, and we are dealing with an 
autonomous system with one additional dimension. 

The equation of motion (23.1) is very far-reaching, in spite of its simple shape. In 
particular, it incorporates the Hamiltonian mechanics as a special case: For a system 
with N degrees of freedom described by the generalized coordinates q\,... ,q^ and 
the associated canonical momenta p\, ..., pw, the Hamiltonian equations of motion 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_23, © Springer-Verlag Berlin Heidelberg 2010 
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23 Dynamical Systems 


(see Chap. 18) read as follows: 




(23.2) 


If the coordinates and momenta are combined according to x = (qi,...qN\ 
pi,, pn) T to a 2 N -dimensional vector, then (23.2) may be written as a combined 
matrix equation of the form (23.1): 


— x = J W X H. 
dt 


(23.3) 


V x // stands for the gradient vector of the Hamiltonian function, 


/dH dH dH dH 
dq N ’ dpi'”' dp N 


(23.4) 


and the 2 N x 2N-maXnx J provides both the permutation of the components as well 
as the correct signs: 


J = 




(23.5) 


where I denotes the N x N unit matrix. By the way, J has the following useful prop¬ 
erties: 


J _1 =J T = -J, J 2 = -I, det J = 1. 


(23.6) 


Moreover, dissipative systems may also be described by (23.1) by introducing 
velocity-dependent friction terms; see, e.g., Example 23.2. 

Obviously, the solutions of (23.1) may be highly manifold. For a given starting 
vector x(t = 0) = xo, a trajectory x(t ), also called an orbit , may be calculated (which 
in nonlinear systems, as a rule, is of course not feasible in an analytic manner) the 
mathematical existence and uniqueness of which is guaranteed under very general 
conditions by the theory of differential equations. Of particular interest is the asymp¬ 
totic behavior of the trajectory at large times: Does it reach a stationary state (a fixed 
point ) or a periodic vibration (a limit cycle), or does it behave irregularly? 

The connection between x(t) and xo is mathematically a mapping : R N -> R N , 
namely, 

<D,(xo) = x(f). (23.7) 


This mapping that depends on the time f as a parameter is called the phase flow or 
simply the flow of the vector field F(x). For t = 0, the flow obviously reduces to the 
identical mapping 

*t =o = L (23.8) 


Furthermore, for autonomous (not explicitly time-dependent) systems, we have for a 
subsequent performance of two time shifts 


(23.9) 
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For a thorough understanding of the dynamical system, it is not sufficient to inspect 
individual trajectories. Of much more interest is the behavior of the ensemble of all 
trajectories in phase space. This may be interpreted as a query for the global properties 
of the mapping Important questions are as follows: Can the flow be characterized 
universally “on the large scale”? Are there regions with qualitatively distinct behavior 
(ordered vs. disordered motion)? How does the flow vary with the value of a possibly 
existing control parameter X (are there critical threshold values at which a new type of 
behavior arises)? 

The answer to these questions depends of course on the system being considered. 
Nevertheless, one may find general criteria in the frame of nonlinear dynamics, and 
it turns out that seemingly very distinct systems display amazing similarities in their 
dynamics. 


23.1 Dissipative Systems: Contraction of the Phase-Space Volume 


Conservative systems are characterized by a volume-conserving dynamical flow. Li- 
ouville’s theorem , proved in Chap. 18, states that the volume of a cell in the 2TV- 
dimensional phase space (qi ,..., q^; pi, Pn) does not vary with time if the 
points contained therein are moving according to the Hamiltonian equations. In dissi¬ 
pative systems , on the contrary, the cells in phase space are shrinking with time. We 
shall now derive a quantitative measure of this phenomenon for a general autonomous 
dynamical system, the trajectories of which obey the equation of motion 

2-x(0=F(x(0) (23.10) 

in an A-dimensional phase space. To this end, we consider a small volume element 
A V (x) that at time t = to shall be at the position x = xo and shall move with the flow. 
In Cartesian coordinates, the volume is given by the product of the edge lengths, 

N 

A V(x) = J~[ Axi (x). (23.11) 

i=1 

The time derivative of this quantity is, according to the chain rule, given by 


, N , A / \ N 

a a A Xi (x) 

-av W = £ 


i=1 
N 


dt 


rw*) 


N 


=n ax jwji 


;=i 


i=\ 


1 dAxi(x) 
Axi (x) dt 


=AV(x) 


(23.12) 


where the extension Ax; / A x\ has been added. Hence, the relative change (= logarith¬ 
mic time derivative) of the volume is 


1 d 
A V(x)dt 


AV(x) = 


N 


E 


1 d Axi (x) 

Axi (x) dt 


(23.13) 
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The change of the edge lengths of the volume 1 may be calculated from the equation of 
motion (23.10). Let us consider the distance between two edges of the cube along the 
/-direction which are determined by the trajectories xo(0 with xo(fo) = x 0 and x(t) 
with x(*o) = xo + e; Ajq : 


dAxi 


dt 


= — {xi (t) - XOi ( 0 ) 


= F,(x(to)) - Fj (x 0 (?o)) 

= Fi(x o + e. Ax/) - F/(x 0 ). 


(23.14) 


For small deviations Axi , the Taylor expansion of F(x) yields to first order 


dAxi 


dt 


to 


c m 

dXi 


A Xi. 
xo 


At the point to = t,xo = x (23.13) thus yields 


(23.15) 


A(x) := 


N 


i d A dFi 

-AV(x) = V —- = V • F. 

A V(x) dt ^ dxi 

i—\ 


(23.16) 


The rate of change A of the phase-space volume is therefore determined by the diver¬ 
gence of the velocity field F. 

Liouville’s theorem is included in (23.16) as a special case. According to 
(23.3)-(23.5), the velocity field of a Hamiltonian system with the coordinates x = 
(qi,...,q N ; p\,p N ) T reads 


dH dH dH dH\ l 


F(x) = ( --;- 

\dpi dp N dqi 

This leads to the volume change 


dqN J 


N , N , 

a = v. f = J2 fpt + E ir pN+i 
U 9 * tl d P‘ 

N N 

_ A d dH A d dH _ o 
“ dqt dpi " 3 Pi dqi 


(23.17) 


(23.18) 


which confirms that conservative systems are volume conserving. 

If the flow in phase space is contracting , i.e., if A = V • F < 0, the system is called 
dissipative. This is so far a local statement holding at a point x in phase space. In order 
to get a global estimate of the dynamics, A(x) has to be averaged over a trajectory x(t). 
If A thereby changes sign, then there is no simple method of finding out whether the 
system is dissipative; one actually has to evaluate the mean value. 

In dissipative systems, the volume filled by neighboring trajectories shrinks with 
increasing time; asymptotically it even tends to zero. This may happen in a trivial 
manner if the trajectories are converging. In the simplest case they are moving towards 


1 Strictly speaking, the shape of A V is distorted, and the edges do not remain orthogonal to each 
other. But this is of no meaning when calculating the volume to lowest order. 
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an equilibrium point and the motion comes to rest (see the section on limit cycles). 
There is, however, also the possibility that the volume is shrinking and the distance 
between the trajectories is being reduced only along certain directions, while they are 
diverging in other directions. In this case the resulting distance even increases with 
time. An originally localized region in phase space is so to speak “rolled out” and 
widely distributed by the dynamic flow. The shrinking of the volume towards zero 
then means that an originally A-dimensional hypercube in phase space changes over 
to a geometric object with lower dimension D < N. D may even take a nonintegral 
value, as will be explained in Chap. 26. 


23.2 Attractors 

The dynamics of a nonlinear system may be highly complicated. It is convenient to 
distinguish between transient and asymptotic behavior. A transient process denotes 
the initial behavior of a system after starting from a given point xo in phase space. 
Naturally, it is particularly difficult to make general statements, since the transients 
depend on the particular initial condition. Theorists therefore tend to ignore this part 
of the trajectory, even if it may play an important role in practice, depending on the 
dominant time scales. Only recently has the study of transients gotten more attention. 

The systematic treatment of the asymptotic or stationary behavior of a system is 
somewhat simpler. “Stationary” shall not mean here that the system is at rest but only 
that possible transient phenomena have faded away. In dissipative systems, which will 
be treated here, the trajectories will asymptotically approach a subset of the phase 
space of lower dimension, a so-called attractor. 

The definition and correct mathematical classification of attractors is not quite sim¬ 
ple. Actually there are several concepts in literature that differ from each other in de¬ 
tail. Here we first give a mathematical definition 2 but shall also illustrate the concept 
of the attractor by various examples in the subsequent chapters. 

Let us consider a vector field F(x) on a space M (e.g., M = R N ) with an associated 
phase flow . A subset AcMis denoted as an attractor if it fulfills the following 
criteria: 

(1) A is compact. 

(2) A is invariant under the phase flow 

(3) A has an open environment U that contracts to A under the flow. 

This statement needs several explanations: 

(1) A set is called compact if it is closed and restricted. This means that any limit 
value of an infinite sequence belongs itself to the set, and the set cannot extend up 
to infinity. “Exploding” solutions where for example particles escape to infinity 
therefore cannot be attractors. 

(2) Invariance under the phase flow means that 

cD,(A) = A for all t. (23.19) 

Hence, a point on the attractor never may leave this attractor. 


2 F. Scheck, Mechanik, Springer (1992). This book is also available in English: F. Scheck, Mechan¬ 
ics: From Newton’s Laws to Deterministic Chaos, 3rd edition, Springer (1999). 
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Fig. 23.1 . Visualization of the 
definition of an attractor A 
{hatched). In the course of 
time evolution, the environ¬ 
ment U is shrinking such that 
for t > ty it is enclosed in 
any arbitrary smaller environ¬ 
ment V 



(3) This may be formulated in two steps. First, the environment UD A is larger than 
the attractor itself, since we are dealing with an open range that includes the com¬ 
pact A. U shall be positively invariant , i.e., 

0 ? ((/)c[/ for all f >0. (23.20) 

If a point once lies within U, then it cannot leave it. It will, on the contrary, even be 
pulled toward A, which may be formulated as follows: For any open environment 
V of A that lies completely within U, i.e., A c V C U , one can find a time ty 
after passing that the image of U lies entirely within V : 

<&*(£/) C V for all t > ty. (23.21) 

Since V may be chosen arbitrarily “close” about A, this means that for large time 
values U is shrinking toward the attractor A. 

Frequently, the definition of an attractor is still extended by the requirement 
that it shall consist of one piece only. 

(4) A cannot be separated into several closed nonoverlapping invariant subsets. 

An important property of an attractor is its domain of attraction. The maximum 
environment U that contracts to A is called the basin of attraction B. In correct math¬ 
ematical formulation, B is the union of all open environments of A that fulfill the 
conditions (23.20) and (23.21). 

The introduction of the concept of attractor given here is rather complex. This is 
justified, however, by the fact that attractors may have very complex properties. Of 
central importance for nonlinear dynamics are the concepts of strange and chaotic 
attractors, which sometimes—not quite correctly—are used as synonyms. These con¬ 
cepts will become fully transparent only in the subsequent chapters and by examples. 
But we shall present the definitions now: 

Chaotic attractor: The motion is extremely sensitive with respect to the initial con¬ 
ditions. The distance between two initially closely neighboring trajectories increases 
exponentially with time. For more details see Chap. 26. 

Strange attractor: The attractor has a strongly rugged geometrical shape that is 
described by a fractal. For more details see Chap. 26. 

Both of these properties arise, as a rule, in common. There are, however, also ex¬ 
amples 3 where an attractor is chaotic but not strange, or is strange but not chaotic. 


C. Grebogi, E. Ott, S. Pelikan and J.A. Yorke, Physica 13D, 261 (1984). 


3 
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23.3 Equilibrium Solutions 


A particularly simple case arises if the system is in stationary equilibrium, i.e., 

F(xo) = 0 such that x(t) = xo = constant. (23.22) 

Such an xo is also called a critical point or fixed point. Of particular interest is the 
question of whether or not the system is moving toward such a fixed point and—if 
several ones exist—to which of them. A fixed point that attracts the trajectories is the 
simplest example of an attractor. In this case the set A defined in the previous section 
is trivial and consists of a single point. 

We are therefore interested in the stability of equilibrium solutions. To this end we 
consider the trajectories x(t) in the vicinity of a critical point xo. We thus require that 
the distance 


HO =x(t) -x 0 


(23.23) 


be a small quantity. Under this condition, the problem may be greatly simplified, since 
it usually suffices to take only the lowest term of the Taylor expansion of F (x) into 
account. The linearized equation of motion then reads 


dt 


HO = MHO, 


(23.24) 


where terms of quadratic or higher order in § were neglected. M denotes the Jacobi 
matrix (functional matrix) of the function F(x) evaluated at the position xo. This ma¬ 
trix has the elements 


dFi 

M ik = -r 1 
dx k 


(23.25) 


xo 


Contrary to the original nonlinear equation of motion (23.1), the solution of the lin¬ 
earized problem (23.24) is in principle simple; it may be given analytically. Let us 
first consider the trivial special case of a one-dimensional system (N = 1). The Jacobi 
matrix then has only a single element, say, /z, and (23.24) is solved by 


m=e^m- 


(23.26) 


The character of the solution is determined by the sign of /z: For /z < 0, xo 
is a stable equilibrium point, since small perturbations decay exponentially. For 
/z > 0, the equilibrium is unstable, since even the smallest displacements from 
the equilibrium position “explode” exponentially. For /z = 0, the limit case of in¬ 
different or neutral equilibrium arises. The behavior of the system under pertur¬ 
bations is then determined by the higher derivatives of the function F(x) at the 
point xo. 

The general case (N > 1) may be treated as in Chap. 8 by the method of normal 
vibrations. One then constructs solution vectors u (t) normalized to 1, all components 
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of which show the same (exponential) time dependence: 

u (t) = e^u. (23.27) 

Using (23.24), one arrives at the eigenvalue problem 

Mu = fin. (23.28) 

This N -dimensional linear system of equations only has nontrivial solutions if the 
determinant 


det(M|-y — /JiSij) = 0 (23.29) 

vanishes. This characteristic equation (secular equation) has as a polynomial of Nth 
order in general N eigenvalues pi n with the associated eigenvectors u n . The general 
solution of (23.24) may then be written as a superposition 

N 

m = J2 c " e * ,t *n, (23.30) 

n =1 

where the expansion coefficients c n may be determined from the initial condition at 
t = 0. The eigenvalues (i n may be real or complex. Complex eigenvalues thereby arise 
always pairwise : If ti n solves (23.29), then the complex-conjugate /x* obviously also 
solves the equation, since the Jacobi matrix M[j is real. 

The real parts of the eigenvalues of the characteristic equation are decisive for char¬ 
acterizing an equilibrium point xo. We now define a tightened form of the condition 
of stability: An equilibrium point xo with F(xo) = 0 is called asymptotically stable if 
there exists an environment U 3 x o within which all trajectories are running toward 
xo for large times: 

lim x(t) = xo for x(0) e U. (23.31) 

t^oo 

If the function (the vector field) F is sufficiently smooth so that it can be described 
by the linear approximation, one may immediately give a sufficient condition for as¬ 
ymptotic stability : The point xo is asymptotically stable if all eigenvalues of the Jacobi 
matrix have a negative real part, i.e., if 

^ P'n < c < 0 for all n = 1,..., N (23.32) 

with a positive constant c. 

A glance at (23.30) shows that under this condition all contributions to the dis¬ 
placement £(f) exponentially tend to zero, such that asymptotically 

|| x(t) — xo|| < constant • e~^ mmn . (23.33) 

Conversely, if at least one of the eigenvalues has a positive real part, 9i/x n > 0, then 
xo is an unstable fixed point, since displacements along are increasing exponen¬ 
tially. 

With the knowledge of the eigenvectors u n , the total phase space may be spanned in 
partial spaces. The stable (or unstable) partial space is spanned by all vectors satis¬ 
fying < 0 (or > 0). In addition, a partial space may occur with the special value 
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$ip n = 0. If this happens, one speaks of a degenerate fixed point. (The associated par¬ 
tial space is also called the center ; but we shall not deal here in more detail with the 
related problems.) If one considers a general perturbation of a trajectory, it will have 
components in all partial spaces. After a sufficiently long time the contribution with 
the maximum ?Rp n will dominate. 

Finally, we note that the linear stability analysis holds only in the vicinity of a 
critical point xo. It may be shown mathematically that the topological behavior of the 
flow does not change there under the influence of the nonlinearity. But this vicinity 
may be very small, such that one cannot make a statement about the global behavior 
of the flow by this way. 


EXAMPLE 


23.1 Linear Stability in Two Dimensions 


The stability analysis becomes particularly transparent for the case N = 2 that corre¬ 
sponds to a dynamic system with one degree of freedom x\ = q and the associated 
momentum X 2 = p. In the vicinity of a fixed point x = F(xo) = 0, the motion is deter¬ 
mined in a linear approximation by the four elements of the Jacobi matrix M;y. The 
characteristic equation (23.29) 


M ii — p M] 2 

M21 M22 — P 


(23.34) 


is a quadratic polynomial 

p 2 — (Mu + M 22 )p + Mi 1 M 22 — M 12 M 21 = 0 (23.35) 


or 


p 2 — 2s p + d = 0 


(23.36) 


with 

1 1 

s = — (Mu + M 22 ) = —Tr M, d = M\ 1 M 22 — M 12 M 21 = detM. (23.37) 

The two solutions of (23.36) may be given explicitly: 

At 1/2 = s±y/s 2 -d. (23.38) 

Depending on the magnitude and sign of the two constants s and d, there are many 
distinct possibilities for the eigenvalues pi, P 2 - 

(a) pi , P 2 real and both negative (if ^ < 0 and 0 < d < s 2 ) stable node 

(b) pi , P 2 real and both positive (if ^ > 0 and 0 < d < s 2 ) unstable node 

(c) pi , P 2 real with distinct signs (if d < 0) saddle 

(d) pi = p1^, negative real part (if ^ < 0 and d > s 2 ) stable spiral 

(e) pi = p* v positive real part (if s > 0 and d > s 2 ) unstable spiral 

(f) pi = p\, purely imaginary (if s = 0 and d > 0) rotor 

The ranges are represented in Fig. 23.2 in the s, d-plane. To these alternatives, there 
correspond distinct types of trajectories £ (t) = x(t) — xq according to (23.30). 
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Fig. 23.2. Ranges of distinct 
stability depending on the pa¬ 
rameters 5 and d 



Fig. 23.3. Various types of 
stability of a fixed point in two 
dimensions. Upper row\ Sta¬ 
ble and unstable node, saddle. 
Lower row. Stable and unsta¬ 
ble spiral, rotor 





Figure 23.3 illustrates how the trajectories in the vicinity of a stable node are run¬ 
ning into the fixed point: 

§(f) = cie“ l/il|r ui +c 2 e H/i2l 'u 2 , (23.39) 

where ui and U 2 are the (not necessarily orthogonal) eigenvectors. The curvature of 
the trajectories arises if fi\ / These curves are parabola-like, with a common tan¬ 
gent at the origin (in ui - or U 2 -direction depending on whether [i 2 or ii\ is larger). The 
trajectories for the unstable node , Fig. 23.3(b), have the same shape but are passed in 
the opposite direction (exponential “explosion”). For the case of a saddle the trajec¬ 
tories are running in the ui-direction (let < fi 2 without restriction of generality) 
toward the fixed point but are pushed off in the U 2 -direction, which results in the 
hyperbola-like trajectories of Fig. 23.3(c). 

If the eigenvalues are complex , 

li\=lL r + im, iL 2 = lL r -im, (23.40) 

this will hold, because of (23.28), for the eigenvectors too: 


ui =u r +/Uj, 


U 2 = U, - Ml;. 


(23.41) 
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The general solution (23.30) then has the form 


Example 23.1 


§ (t) = c\ e^ui + C2 e^ 2t U2 
= c\ +c\ e^u* 

= 2e /Xrt m(c 1 e^ it u 1 ), 


(23.42) 


(23.43) 


where C 2 = c\ to get § real. If the constant c i, the value of which is fixed by the 
initial condition § (0), is split into magnitude and phase, and the same is done for the 
Cartesian components of the complex eigenvector ui, 


a=pe l<t) , ui = ae l0l e x -\-be l ^e y , with a 2 + b 2 = 1, 


(23.44) 


then (23.42) may be rewritten as follows: 

§(0 = 2 p e^ rt [acos(/jLit + 0 + aOe* + frcos(/z*T + 0 + /3)e y ]. 


(23.45) 


The factor in brackets describes harmonic vibrations shifted in phase relative to each 
other (if a ^ j3). One thus has the parametric representation of an ellipse. Due to the 
prefactor, the size of the ellipse varies exponentially with time. Thus, the trajectories 
are logarithmic spirals moving toward the fixed point or away from it, depending on 
the sign of the real part of /x; see Fig. 23.3(d), (e)—hence, the name spiral. The case 
of the rotor with DT/x = 0 plays a particular role, since the trajectories in the vicinity of 
xo are periodic functions (concentric ellipses). This means that the equilibrium point 
is stable (small displacements are not amplified) but not asymptotically stable (the 
trajectory does not run into the fixed point), and hence this point is not an attractor. 

EXERCISE 

23.2 The Nonlinear Oscillator with Friction 

Problem. Let a one-dimensional system be described by the following equation of 
motion: 


x + ax + fix + yx 3 = 0. 


(23.46) 


Show that the system is dissipative. Interpret the individual terms and discuss the 
possible fixed points and their stability. 

Solution. We are dealing with a harmonic oscillator involving friction and nonlinear¬ 
ity. Besides the linear backdriving force of the harmonic oscillator (third term), there 
acts a friction force proportional to the velocity (second term). Moreover, a cubic non¬ 
linearity (fourth term) becomes important. This force law corresponds to a potential 



(23.47) 


where m denotes the mass. We obtain various types of motion, depending on the 
magnitude and sign of the constants in (23.46). We first rewrite the equation of mo¬ 
tion (23.46) in the standard form. For this purpose, we introduce the velocity as 
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Exercise 23.2 


a second coordinate, x = (x, y) = (x, i), which leads to the coupled differential equa¬ 
tions of first order: 


x 


-(*) = ( y 
dt\y) \-ay-fix — yx 3 


= F (x). 


(23.48) 


For a > 0, the system is dissipative, since the divergence of the velocity field is 
9 9 

A = V • F = —y H- (-ay -fix- yx 3 ) = -a < 0. (23.49) 

dx dy 

The equilibrium condition F(xo) reads 

y = 0, x(fi + yx 2 ) = 0 . (23.50) 


Hence, besides the equilibrium position xo = (0, 0) without displacement, there still 
occur two further symmetrically positioned fixed points xo = (± y/—fi/y, 0 ), provided 
that the constants fi and y have distinct signs. Figure 23.4 shows the associated po¬ 
tential functions V(x) for all combinations of signs. 


Fig. 23.4. Potentials of the 
quadratic oscillator for vari¬ 
ous signs of the parameters fi 
and y 



To discuss the linear stability, we need the Jacobi matrix 

(3F x /3x 3Fi/3y\ f 0 1 \ 

\dF 2 /dx dF 2 /dy) \-P-3yx 2 -a) ' 


(23.51) 


The characteristic equation (23.36) in Example 23.1 for the eigenvalues involves the 
following coefficients: 


Forxo = (0,0): s = --a, d = fi, 

Forxo = (0, 3z^/—fi/y): s = --a, d = —2/3. 

Obviously, asymptotic stability may occur only for a positive sign of the constant 
a > 0. Only then is one dealing physically with a damping friction term. For the fixed 
point in the rest position vo = (0, 0), we get the alternatives 

(1 ) fi > \ol 2 stable spiral 

(2) 0 < fi <\a 2 stable node 

(3) fi < 0 saddle 

In the first case, there arise weakly damped vibrations, in the second case the oscillator 
is overdamped, and the displacement monotonically tends to zero. For fi < 0, the equi¬ 
librium position is unstable, as may be seen from the potential plots in Fig. 23.4(c), (d). 
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The analogous considerations for the fixed points xo = (i^/— f/y, 0) lead to (assum¬ 
ing a > 0 ): 

(1) — 2/3 > ^ a 2 stable spiral 

(2) 0 < — 2/3 < \a 2 stable node 

(3) f > 0 saddle 

The factor 2 arises because the curvature of the potential (23.47) in the equilibrium 
positions with finite displacement is twice as large as in the rest position. Only the 
double-oscillator potential (Fig. 23.4(c)) allows stable displaced fixed points (ft < 0 
and y > 0 ). 

It is instructive to plot the position of the fixed points as a function of the pa¬ 
rameter (3. As is seen from Fig. 23.5, for = 0 there occurs a square-root branch¬ 
ing. For y > 0, a stable equilibrium position bifurcates into two new stable solutions. 
Such bifurcations (Lat .furca = fork) frequently occur in nonlinear systems; see also 
Chap. 25. 



23.4 Limit Cycles 

Besides the simple stationary equilibrium points studied in detail in the section on 
attractors, a dynamic system may exhibit still other types of stable solutions. These 
are the so-called limit cycles that are characterized by periodically oscillating closed 
trajectories. Similar to the fixed points discussed already, limit cycles may also act as 
attractors of motion; compare the section on attractors. Then there exists a more or less 
extended range in phase space (the “basin of attraction” of the attractor): trajectories 
starting from there move toward the limit cycle, which is approached for t —► oo. For 
limit cycles, one may also perform a mathematical stability analysis as for fixed points 
which by its very nature is somewhat more difficult. 

We shall concentrate ourselves here to a special but typical example, namely a har¬ 
monic oscillator with a nonlinear friction term. The associated differential equation 
has the general form 

d 2 x dx 9 

—~y + f(x)—+ co 2 x = 0. (23.52) 

dt z dt 

If the middle term is absent, we obtain a harmonic oscillator with angular frequency co. 
The case of a constant coefficient, f(x)=a = constant, leads to a linear differential 
equation that may be solved easily. The character of the solution is determined by an 
exponential factor exp (—at/ 2). The solution thus decreases exponentially toward the 
fixed point at x = x = 0 if a is positive. A negative value of a means that a force is 
acting along the same direction as that of the instantaneous velocity which leads to an 
unlimited amplification of the solution (negative damping). Physically one of course 


Exercise 23.2 


Fig. 23.5. Position of the sta¬ 
ble (continuous) and unstable 
(dotted) fixed points depend¬ 
ing on the parameters ft and y 
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no longer deals with a friction force; rather an external source must exist that pumps 
energy into the system. 

If allowance is made for more general functions /(x), it may happen that the damp¬ 
ing coefficient takes partly positive, partly negative values, depending on the displace¬ 
ment. Of particular interest is the case when /(x) is negative for small magnitudes of 
x, and positive for large displacements. The simplest ansatz providing such a behavior 
is a quadratic polynomial 

f(x)=a(x 2 -Xq), (23.53) 


where a determines the strength of the damping/excitation and two zeros are at x = 
±xo. The zeros may be set to the value 1 without loss of generality by rescaling the 
variables to x' = x/xo with a' = «Xq . For convenience one may also choose the value 
1 for the frequency by rescaling the time: t' = coot with a" = a'loo. The standard form 
of the equation of motion then reads (dropping the primes again): 

d 2 x 9 dx 

-pr +o/(x 2 — 1)— + x =0. (23.54) 

dt z dt 

This differential equation has been set up and discussed in 1926 by the Dutch engi¬ 
neer B. van der Pol. It served first for describing an electronic oscillator circuit with 
feedback (at that time still with valves), but it was already clear to the author that his 
equation could be applied to a variety of vibrational processes. Actually the origin 
of this equation may be traced back even further, since around 1880 Lord Rayleigh 
investigated the following differential equation in the context of nonlinear vibrations: 


d 2 v 1 / dv \ 3 
~dt I+ °‘ 3 \dt) 


dv 

dt 


+ v = 0 . 


(23.55) 


One easily sees the relation between (23.54) and (23.55). We have only to differentiate 
the Rayleigh equation (23.55) with respect to time and then substitute 


dv 

dt 


= x 


(23.56) 


to get the van der Pol equation (23.54). Thus, both equations are essentially equivalent 
to each other. 

We now discuss the solutions of the van der Pol equation (23.54). It may be trans¬ 
formed as usual to the standard form (23.1) of two coupled differential equations of 
first order for the vector x(t) = (x, y) T : 


dx 

dt 


= y, 


— = —x — a(x 2 — l)y. 
dt 


(23.57) 

(23.58) 


It is now advantageous to transform to polar coordinates in the x, y-phase space: 


x = rcos0, y = r sin 6. 


(23.59) 


The time derivatives of r and 6 may be expressed by those of x and y. For the radius 
coordinate the relation follows immediately from the differentiation of r 2 = x 2 + y 2 : 

dr dx dy 

r — =x ~r + y~r- 

dt dt dt 


(23.60) 
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An analogous relation for the angle coordinate may be obtained from the time deriva¬ 
tives of (23.59): 


dx 

dt 


dr dO . 

— cos 6 — r — sin 0, 
dt dt 


dy 

dt 


dr . dO 
— sin 0 + r — cos 0 . 
dt dt 


(23.61) 

(23.62) 


By multiplying the first of these equations by y and the second one by x and subtract¬ 
ing both equations, one obtains 


r 


2 


dO dy 
dt dt 



(23.63) 


Using (23.60) and (23.63), the van der Pol system of equations in polar coordinates 
reads as follows: 


ar 2 2 2 

— = —a(r cos 0 — l)rsin 0 , 
dt 


au 9 2 

— = — 1 — a(r cos 0 — 1)sin#cos0. 

dt 


(23.64) 

(23.65) 


The nonlinear terms on the right-hand side have a rather complex shape, but one may 
give some qualitative statements on the solutions to be expected. In the limit a = 0, 
one has of course a normal harmonic oscillator. The trajectories in phase space are 
circles which are traveled through uniformly with the frequency 1, such that 


x(t) = p sin(f — to) 


(23.66) 


with arbitrary p and to. Due to the nonlinearity in (23.64) and (23.65), the behavior of 
the solution is modified. As long as a 1, the influence on the revolution frequency 
remains small: Since the function sin 0 cos 0 changes its sign twice in each period, 
the influence of the nonlinear term in (23.65) cancels out on the average. It is quite 
different, however, for the radial motion: Here sin 2 0 is positive definite, and small 
changes of the radius may accumulate from period to period. The evolution direction 
of the effect is determined by the sign of — a(r 2 cos 2 0 — 1). In the following we 
shall discuss the (more interesting) case a > 0 (the set of solutions for a < 0 may be 
obtained by inversion of the time coordinate t -> —t). 

For small displacements r < 1, the factor —ot(r 2 cos 2 0 — 1) is then always positive , 
and the radius increases slowly but monotonically. For large displacements r 1, 
on the contrary, the factor is predominantly negative (except for the vicinity of the 
zeros of cos 0), and the mean radius decreases from cycle to cycle. A more detailed 
investigation as is performed in Exercise 23.3 shows that the trajectory in the course of 
time approaches a periodic one, independent of the initial conditions, which for given 
a is uniquely determined. This is the limit cycle of the system. 

As long as a is very small, the limit cycle resembles a harmonic vibration as in 
(23.66). The crucial difference is, however, that the amplitude p now has a sharply 
determined value, namely, p = 2. If one starts from a smaller or larger value, the tra¬ 
jectory is a spiral approaching the limit cycle. The result of a numeric calculation 
for the value a = 0.1 is represented in Fig. 23.6. One can follow the spiraling mo¬ 
tion towards the limit cycle. Moreover, deviations from the purely harmonic vibration 
become visible. 
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Fig. 23.6. (a) The solutions 
of the system of differential 
equations (23.57) in the x, y- 
plane approach the limit cy¬ 
cle (bold curve), a slightly 
deformed circle, independent 
of the initial condition. The 
nonlinearity parameter is a = 
0.1. (b) The solutions x(t) 
of the van der Pol oscillator 
after a transient motion co¬ 
incide with an approximately 
harmonic vibration 



x 



Even more interesting is the solution in the opposite limit a 1, in which the non¬ 
linearity plays a dominant role. Here also a limit cycle evolves for the same reasons, 
the shape of which, however, strongly differs from a harmonic vibration. Figure 23.7 
shows the phase-space plot and the trend of the amplitude of the limit cycle for the 
case a = 10. One notices that the displacement remains in the range of the maximum 
amplitude x = 2 and slowly decreases toward x = l. Subsequently, a sudden “flip- 
over” sets in, and the displacement drops to the value x = —2. Then the game repeats 
with opposite sign. The period length of this kind of vibration is no longer determined 
by the oscillator frequency (here co = 1) but takes a much larger value. An analytic 
investigation (see Exercise 23.4) shows that it increases proportional to the “friction” 
parameter a : 


T~(3 —21n2)a. 


(23.67) 
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Fig. 23.7. Solutions of the 
van der Pol oscillator with 
strong nonlinearity, a = 10. 
The meaning of the curves is 
the same as in Fig. 23.6 



A motion performed by a van der Pol- or Rayleigh oscillator for large a is also 
called relaxation vibration. The name indicates that a tension builds up slowly which 
then equilibrates via a sudden relaxation process. Such relaxation vibrations fre¬ 
quently occur in nature. For example, the vibration of a string excited by a bow, the 
squeak of a brake, and even the rhythm of a heartbeat or the time variation of animal 
populations may be classified in this way. 

An important and also practically useful property of nonlinear oscillators with a 
limit cycle lies in the fact that self-exciting vibrations occur that are well defined and 
independent of the initial conditions. As a somewhat nostalgic example, we quote the 
balance of a mechanical clock, the vibrations of which are largely independent of the 
strength of the driving force. Finally, we quote without proof a mathematical theorem 4 
stating that the possible types of motion of a two-dimensional system (corresponding 


4 See, e.g., J. Guckenheimer and P. Holmes, Nonlinear Oscillations, Dynamical Systems and Bifur¬ 
cations of Vector Fields, Springer (1983). 
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to a mechanical system with one degree of freedom: one coordinate plus one velocity) 
are completely governed by fixed points and limit cycles. 

The theorem of Poincare and Bendixson: Let a two-dimensional dynamic sys¬ 
tem x(t) = ( x(t ), y(t)) T be described by the differential equation 


with a continuous function F. Let B be a closed and restricted range of the x,y-plane. 
If a trajectory lies in B for any time t > 0, x(t) e B , there are three possibilities: 

(i) x(t) is a periodic function, 

(ii) x(t) asymptotically approaches a stationary equilibrium point, and 

(iii) x(t) asymptotically approaches a periodic function (limit cycle). 

The general theorem says of course nothing about the number and shape of the 
fixed points and limit cycles. However, it excludes the existence of more complicated 
nonperiodic types of solutions! It is important that the statement holds only for two- 
dimensional systems. Two trajectories are not allowed to intersect each other in phase 
space, which in the two-dimensional plane leads to considerable restrictions. But in 
more than two dimensions the trajectories may “evade” each other, and more com¬ 
plex patterns of motion are possible. In this case, the already-mentioned strange at¬ 
tractors with a complicated shape may also occur. This will be treated in the next 
chapters. 


EXERCISE 

23.3 The van der Pol Oscillator with Weak Nonlinearity 

Problem. Show that the solutions of the van der Pol oscillator for small values of a 
are spirals that approach a circle (the limit cycle) with the radius 2. 

Hint: It is a good idea to introduce new variables that are averaged over one oscil¬ 
lation period. 

Solution. We start from the plausible assumption that for a, 1 the solution of the 
system of differential equations (23.64) and (23.65) differs only slightly from that of 
the harmonic oscillator if it is considered for short time intervals. In order to calculate 
a long-term drift of the variables, it is efficient to average over one vibrational period 
in each case. We define the averaged amplitude r(t) as 



(23.68) 


The integration thereby extends over a full revolution of the angle, i.e., from 0 to 
0 — 2tt (the minus sign arises because of d6/dt 1). The corresponding time inter¬ 
val runs from t to approximately (for a = 0 exactly) the value t + 2n. 
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We are interested in the time variation of the averaged amplitude for which accord¬ 
ing to (23.64) we have 


— = —a — (& dO r sin 2 0(r 2 cos 2 0 — 1) 
dt 2tt J 


= —a 


2tt 

f dO / 2^ . 2 on • 2/)\ 

1 — rlr - sin 20 — sin 0 . 
J 2n V 4 / 


0 


(23.69) 


For small a , the quantity r(t ) considered over a period varies only slowly, and hence 
may be pulled out of the integral and replaced by r (t). The remaining angle integration 
is trivial, since the mean value of both sin 2 0 and sin 2 20 just equals 1 /2. Hence, the 
averaged amplitude satisfies the differential equation 


dr 

dt 



(23.70) 


which is correct up to the order 0(a 2 ). The circulation frequency, on the contrary, 
does not change to first order: 


dO 

dt 


2n 

/ dO 

— \— 1 — a(r 2 cos 2 0 — 1) sin# cos$] = — 1. 
0 


(23.71) 


The angular integral vanishes here, since the integrand is an odd function with respect 
to 0 = 7T. The differential equation (23.70) for the averaged amplitude may be solved 
in closed form. We write 


dp - u -3 
— =ar — or 

dt 


and transform to the new variable 


1 dr 

u = — such that du = -2—^-. 


(23.72) 


(23.73) 


Obviously, (23.72) reduces to the simple linear differential equation 


1 du 

-= au 


2 dt 


-b , 


(23.74) 


the solution of which is a shifted exponential function: 

u(t) = —h ce~ 2at , 
a 


(23.75) 


where the free constant c is to be determined from the initial condition: c = u(0) — 
b/a. Insertion of ^ = Ck:/2 and b = a/ 8 finally yields 

2r(0) 

r(t) = . 

^r 2 (0) + (4 — r 2 (0))e~ at 


Exercise 23.3 


(23.76) 
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Exercise 23.3 Thus, it is proved that the trajectories are spirals which approach a circle with radius 2 

from inside (r(0) < 2) or outside (r(0) > 2). This is the limit cycle of the van der Pol 
oscillator for small values of a . 


EXERCISE 


23.4 Relaxation Vibrations 


Problem. Discuss the solutions of the Rayleigh oscillator (23.55) qualitatively for 
large values of the parameter a 1. Find an approximate solution for the period 
length of the resulting relaxation vibration. 

Solution. The differential equation (23.55) of the Rayleigh oscillator written in stan¬ 
dard form reads 


dv 



In order to discuss the behavior of the solution for large values of a, it is convenient 
to rescale the amplitude to a new variable z = v/a : 


dz 

dt 

dx 

dt 


1 

= —x, 
a 

= -a[z + f(x)\, 


(23.78) 


with the abbreviation f(x) = (v 3 /3) — x. From this quantity one may read off the 
direction of the trajectory for any point of the z,x- plane: 


dx dx/dt 2 £ + /(•*) 
dz dz/dt x 


(23.79) 


This means that for a ^ 1, the trajectories are almost vertical. Other directions 
may occur only near the curve z(x) = —f(x). This cubic limit curve subdivides the 
z, x-plane into two halves (see Fig. 23.8). In the right half, the derivative dx/dt i s neg¬ 
ative, according to (23.78), and the trajectories are running (almost) vertically down¬ 
ward. In the left half, they are running upward. 


Fig. 23.8. The cubic limit 
curve z(x) determines the as¬ 
ymptotic behavior of the re¬ 
laxation vibrations of the van 
der Pol oscillator 
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From this knowledge, the motion for large a may be constructed graphically. Be¬ 
ginning from an arbitrary initial point, e.g., the point O in the figure, the trajectory 
at first falls almost vertically down to the curve z(x) = — f(x). The further motion 
proceeds with significantly lower velocity near this curve (directly on the curve the 
velocity would vanish, dx/dt = 0). Finally, the point of inversion B is reached at 
(z,*) = (2/3,1). 

Because dz/dt > 0, the trajectory cannot follow the backward-running branch of 
the curve but “falls down” to the point C at (2/3, —2). Now the game is repeating 
with inverse sign. The curve ABCD forms the limit cycle of the Rayleigh oscillator. 
It consists of two slowly passed parts (x = 2.. A and —2... —1) and two fast jumps 
(x = 1... — 2 and —1... 2). This discussion immediately applies, of course, to the van 
der Pol oscillator, since according to (23.56) its displacement just corresponds to the 
velocity v of the Rayleigh oscillator introduced in (23.77). 

The period length T of the relaxation vibration may be evaluated easily: 

T = £dt = a£^-, (23.80) 


where the integral extends over a full period. Since the motion along the partial 
branches BC and DA proceeds very quickly, it is sufficient to calculate the contri¬ 
bution of A B : 


T - a I ~x +a f 

AB CD 


dz 

X 


= 2a 


/ 


AB 


1 



dx 


2 


dz/dx 

x 


(23.81) 


The derivative dz/dx is to be formed on the curve AB , i.e., dz/dx = — df/dx : 
1 2 


T 22 2a I dx ——— =2 a I dx 


!■ 


'/■ 


x 2 -l 


. 1 2 

= 2al -x — In v 


2 l 

= (3 -21n2)a~ 1.614a. 


(23.82) 


The period length calculated numerically in Fig. 23.7b for a = 10 amounts to about 
19, hence the asymptotic range is not yet fully reached. 


Exercise 23.4 









In Chap. 23, we have considered the stability of an equilibrium point xo by investigat¬ 
ing the behavior of the trajectories in the vicinity of this point. Now we are interested 
in the environment of a time-dependent reference trajectory x r (t). The former case of 
a stationary fixed point x r (t) =x o = constant is of course included too. One may again 
distinguish between various kinds of stability. Stability in the sense of Lyapunov exists 
if a point on a neighboring trajectory x(t) remains close to x r (t) for all times. The 
formal expression of this concept reads as follows: A path x r (t) is called Lyapunov- 
stable if for any € > 0 a value 5(6) > 0 can be found such that any solution with 
|x(fa) — x r (fa)| < 5 satisfies the condition |x(0 —x r (t )| <6 for all times t > to. 

Figure 24.1(a) shows that the “perturbed” paths are confined within an (N + 1)- 
dimensional tube of radius 6 about the reference trajectory. This does not yet mean 
that the trajectories are approaching each other with time. If the latter happens, one 
speaks as before (see (23.31)) of asymptotic stability; see Fig. 24.1(b). 


x(t) 



(b) 


(a) 


Fig. 24.1 . (a) The neighboring paths x(t) of a Lyapunov-stable path x r ( t ) remain in its vicinity, 
(b) In the case of asymptotic stability, neighboring paths are attracted such that the distance 
decreases to zero with increasing time 

A path x r (t) is called asymptotically stable if it is Lyapunov-stable and if for the 
neighbor trajectories lim^oo |x(0 — x r (f)| = 0. 

It is of interest that for time-dependent trajectories x r (t) there exists still another 
concept of stability that was not needed for stationary fixed points xo. It may happen 
that although the shape of two paths in phase space is very similar, these paths are nev¬ 
ertheless passed with distinct speeds; see Fig. 24.2. Thereby a time shift may evolve, 
| x(t) — x r (t) | increases, and the definitions of stability given so far do not apply. One 
therefore introduces a weakened version: A path x r (t) is called orbitally stable if for 
any 6 > 0 a value 8 (e) > 0 can be found such that any solution with |x(fa) — x r (fa) I < 8 
is confined for all times t > to within a tube of radius 6 about the path x r (t). 


Fig. 24.2. Example of a path 
that is asymptotically sta¬ 
ble but not orbitally stable. 
Neighboring paths are passed 
with distinct speeds 
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Using this definition, we consider the geometric position x(t). The time as a para¬ 
meter of this curve, on the contrary, does not play a role. 


24.1 Periodic Solutions 


The investigation of the stability of time-dependent solutions is by its very nature 
more difficult than in the case of stationary fixed points. Here we shall treat the im¬ 
portant special case of periodic solutions. Then we may apply a formalism that goes 
back to the French mathematician Floquet (1883). Thus, we assume that the reference 
trajectory x r (t) repeats itself after a period length T, 

x r (t + T) = x r (t). (24.1) 


This may originate in two distinct manners. First, in an autonomous system vibrations 
may arise by themselves, e.g., in the harmonic oscillator. Here the right-hand side 
of the equation of motion does not depend explicitly on the time, x = F(x). On the 
other hand, there are also periodically externally excited systems which are under the 
action of a time-dependent external drive with the periodicity ¥(x,t T) =F(x,t) 
that reflects itself in the trajectory. An advantage is here that the period length T is 
imposed from outside, while the vibrational frequency of an autonomous system is 
not known from the beginning and must be determined—except for simple special 
cases—by numerical solution of the equation of motion. 

To discuss the stability of x r (t), one investigates, as in (23.23), the neighboring 


trajectories 

x(f)=x r (0 + $(0, (24.2) 

where the deviation § (t) is assumed to be small. From the equations of motion 

x(f)=F(x, t) and x r (t) = ¥(x r , t), (24.3) 

we find 

+ k = F(x r + §, t) = F(x r , t ) + §, (24.4) 

or 

k = F(x r +5,0- F(x r , 0 = G($, t). (24.5) 

This equation can be linearized by expanding the right side in a Taylor series and 
neglecting higher terms: 

5(0=M(0$(0 (24.6) 


with the Jacobi matrix (here written in abstract without giving the indices) 


M(0 = 



3F 


3x 


x r (0 


(24.7) 


Equation (24.6) is, like (23.24), a linear system of differential equations, but now the 
matrix of coefficients is periodically time-dependent, M(t + T) = M(0, while for¬ 
merly it was constant. This periodicity also holds for autonomous systems: Although 
the function F(x) does not involve the time explicitly, the reference trajectory x r (t) by 
itself nevertheless induces a periodic time dependence. 
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24.2 Discretization and Poincare Cuts 

There exists a mathematical tool that is useful for the stability analysis of time- 
dependent paths but also in general for the qualitative understanding of dynamic sys¬ 
tems. The basic idea is to perform a discretization of the time dependence of a trajec¬ 
tory. This may be done in two somewhat different ways. 

An obvious possibility is the stroboscopic mapping. Instead of the continuous func¬ 
tion x(t), one considers a discrete sequence of “snapshots” x n = x(t n ), n = 0, 1, 2 ,.... 
The time points of support of the spectroscopic method are chosen as equidistant, 
hence t n = to + nT with a scanning interval T. Of course one should choose a value 
of T that is appropriate for the problem. If an oscillating driving force is acting, one 
will use its period length for T. The stroboscopic method becomes particularly simple 
if the trajectory x(t) itself is periodic and T coincides with the period length; then all 
x n are of course identical. The stroboscopic mapping of the path consists of a single 
point x n = xo in phase space. One should note that the position of the point xo depends 
of course on the selected reference time to and thereby may be shifted arbitrarily along 
the orbit. 

As was described in the preceding section, for a stability analysis one investi¬ 
gates neighboring trajectories x(t) that are in general no longer strictly periodic. The 
thin line in Fig. 24.3 shows such an example. The first three stroboscopic snapshots 
xo, xi, X 2 are marked by dots and the distance vectors % n = x n — x r o are plotted. 

An alternative method of discretization of trajectories, which is not oriented to the 
periodicity and is of advantage particularly for autonomous systems having no fixed 
eigenfrequency, is the Poincare cut. When changing over to the discretized sequence 
x n , one again chooses momentary snapshots of the continuous orbit x(t). As a criterion 
one now adopts not any fixed equidistant time distances, but rather a geometric prop¬ 
erty of the orbit itself, namely the piercing of a given hypersurface £. One thereby 
selects an (N — 1)-dimensional hypersurface in phase space and marks all points x n 
at which the trajectory intersects the hypersurface. One further requires that £ is not 
only touched but properly pierced. Mathematically this means that the surface shall be 
transverse to the dynamic flow, n(x) • F(x) ^ 0 everywhere on £, where n is the sur¬ 
face normal. One therefore speaks of a transverse cut. In a transverse cut one usually 
marks only points with a definite sign of F • n, i.e., only piercings of £ that proceed 
in the same direction. 

This method of discretization of trajectories was invented by Henri Poincare and is 
called the Poincare cut. Figure 24.4 shows as an example a trajectory in an (N = 3)- 
dimensional space, with the x,y-plane as the cut surface £. Three piercings in the 
negative z-direction are marked as xq, xi, X 2 . The use of Poincare cuts makes sense 



Fig. 24.3. The stroboscopic scan¬ 
ning of the distance §(0 = 
x(t) — x r (t) yields informa¬ 
tion on the stability of a path 


Fig. 24.4. The piercing points 
of a trajectory through a 
given surface constitute the 
Poincare cut 
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only then if the trajectory is moving largely or completely in a restricted range of 
the phase space, such that the cut surface is pierced again and again. Many systems 
with approximately periodic or also chaotic motion satisfy this condition. Examples 
of nontrivial Poincare cuts will be given in Chap. 27. 

An advantage of the Poincare cut is first of all the reduction of dimension of the 
phase space from N to N — 1, which may be very helpful for the qualitative discus¬ 
sion. For detailed studies, not only does one want to know the ensemble of points 
x w , but also their detailed sequence. Mathematically, this connection is mediated by a 
mapping 

P : x n -> x n+ \ with x n , x n+ \ e £. (24.8) 

This Poincare mapping thus connects every point of the sequence xo, xi, X 2 ,... with 
its successor. Note that P has no index. It is a single mapping of the plane £ onto itself, 
which according to (24.8) is “scanned” at individual points. The individual points of 
the Poincare cut arise by successive iteration of the Poincare mapping 

Xi = P(xo), x 2 = P(xi) = P 2 (\o), x n = P n (\o). (24.9) 

Hence, the long-term behavior of a trajectory may be derived from the properties of the 
iterated Poincare mapping P n ,n -> oo. If the time evolution of the dynamic system is 
determined by a differential equation x = F(x, t), the Poincare mapping is unique and 
also reversible (possibly except for singular points), since trajectories are not allowed 
to intersect each other. 

The problem of describing a dynamic system is of course not yet solved by defining 
the Poincare mapping but is only postponed, since P must also be constructed explic¬ 
itly. In most cases this cannot be achieved analytically, and one is finally left with a 
numerical integration of the differential equation of the system. It turns out, however, 
that the exact Poincare mapping P frequently has amazing common features with 
very simply constructed analytic discrete mappings. As an example we shall discuss 
the “logistic mapping” in Chap. 27. 

Let us return to the problem of stability of periodic paths. As was outlined in the 
preceding section, it is sufficient to investigate the small deviations $ (t) = x(t) — x r (t) 
from the reference trajectory in the linear approximation. In this approximation the 
Poincare mapping simplifies to a linear mapping , i.e., the multiplication by a ma¬ 
trix C: 

l„+i=c hence, x„ = C%. (24.10) 

Decisive for the long-term behavior of the deviation %(t) are the eigenvalues 
X \,..., Xn of the matrix C. If all eigenvalues satisfy the condition \kt\ < 1, the map¬ 
ping is contracting, and the sequence converges toward zero. In this case the periodic 
solution x r (t) is thus asymptotically stable. If at least one of the eigenvalues \k\ > 1, 
the perturbations are increasing along the direction of the associated eigenvector, and 
the path is unstable. 

In the subsequent example, the mathematical theory of the stability of periodic 
solutions developed by Floquet will be presented in more detail. 
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EXAMPLE 


24.1 Floquet’s Theory of Stability 

As described at the beginning of this chapter, we are interested in the long-term be¬ 
havior of the path deviations %(t) = x(t) — x r (t) which approximately obey a linear 
differential equation 

|-$(r) = (24.11) 

at 

with a periodic matrix of coefficients M (t + T) = M (t). Since we are dealing with 
a linear problem, any solution may be expanded in terms of a fundamental system 
of linearly independent basic solutions 0^(0 • The basic solutions are not 

uniquely determined, and for sake of clarity we choose them in such a way that at the 
time t = 0 (we might choose also t = to) they just coincide with the unit vectors in the 
A-dimensional space: 

0 1 (O) = (1,O, ...,0) T to 0^(0) = (0,0, ...,1) T , (24.12) 

where the transposition symbol T indicates that these vectors shall be column vectors. 
Geometrically all of these vectors are lying on a (hyper-) spherical surface of radius 
unity. The superposition of a general solution § (t) reads 

N 

$(!) = £> 0 /( 0 , (24.13) 

i =1 

which may also be written in matrix form: 

$(t) = Q(t)c. (24.14) 

c is a column vector formed out of the expansion coefficients, and O is an N x N- 
matrix containing one of the basic vectors in each column, 

<K0 = [0i(0. • ••,0 aKO] and c= (ci, ...,c w ) T . (24.15) 

Due to (24.12), the matrix 4> satisfies the initial condition 

0(0)= I. (24.16) 

How does the periodicity of the differential equation (24.11) manifest itself in the 
matrix O? To see this, one should realize that any solution § of (24.11) at the time 
t + T satisfies the same differential equation as at the time t . This does of course not 
mean that the solution will be periodic; in general ^{t + T) ^ §(f). But it may be 
expanded both in terms of the basic solutions 0, (£ + T ) as well as in terms of the 
0 /( 0 , 

$(t + T) = ^(t + T)c and also £(f + T) = <P(t)c'. (24.17) 


This implies a linear relation 
&(t + T) = ®(t)C. 


(24.18) 
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Example 24.1 


The constant N x N -matrix C is called the monodromy matrix. This quantity gov¬ 
erns how the solutions develop from one period to the next. Using the initial con¬ 
dition (24.16), one may immediately read off the value of the monodromy matrix 
from (24.18): 

C = fc(r). (24.19) 

Hence, the monodromy matrix may be calculated by integrating the differential equa¬ 
tion (24.11) N times with distinct initial conditions over a period from 0 to T and 
writing the resulting solution vectors <fii(T) into the columns. The evolution of the 
matrix 4>(0 for arbitrarily large times is obtained by iteration of (24.18). For full pe¬ 
riods in particular, we have 

$(2 T) = *(T) C = $(0)CC = C 2 , (24.20) 

and generally, 

Q(nT) = * n (T) = C n . (24.21) 

According to (24.14) and (24.21), the evolution of the solutions %(t) for large times 
is thus determined by the powers of the monodromy matrix C. What happens thereby 
may be read off from the N eigenvalues ki of this matrix, which are called character¬ 
istic multipliers or Floquet multipliers , 


0(7>* =ki(T)Ui. (24.22) 

If U; is an eigenvector of the matrix 4>(T), then it keeps this property for the iterated 
mapping as well, e.g., 


*(27>j = *(7’)*(7’)u i = *ki(T)Ui =^(Du,' 

= Xi(2T)Ui, etc. (24.23) 

This leads to the following functional equation for the eigenvalues of the iterated map¬ 
ping: 


k i (nT) = k n i (T). (24.24) 

This behavior is characteristic for the exponential function; i.e., (24.24) is solved by 

X l (T) = e (J ‘ T (24.25) 

with an (in general complex) constant <t* that is called the Floquet exponent. We still 
note that (24.21) may be considered a functional equation like (24.24), with the same 
kind of solution 

&(T) = e ST . (24.26) 

Here, a matrix S stands in the argument of the exponential function, and the resulting 
function value is again a matrix. Such a matrix function is mathematically defined 
simply through its power series expansion. One can show that the eigenvalues of the 
matrix S introduced by (24.26) are just the Floquet exponents a\ of (24.25). If one is 
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interested in the evolution matrix at any times (not only multiples of the period T ), 
then (24.26) still has to be generalized: 

$(f) = U(Oe Sr with U(0)=I. (24.27) 

The matrix U(0 may exhibit a complicated time dependence but must be periodic. 
Because of (24.18), we have 

U(; + T)e S(t+T) = $(f)$(r) = U(r)e s V r . (24.28) 


The product of the exponential functions on the right side may be combined to 
exp (SO exp (ST) = expS(£ + T) (for noncommuting matrices in the exponent this 
would in general not be correct), and we find 

U(f+ r) = U(f). (24.29) 


Hence, for the long-term behavior of the solutions §(0, U does not play a role. This 
behavior is only determined by the magnitude of the Floquet multipliers O' • Begin¬ 
ning with an eigenvector £ (0) = u;, this solution according to (24.24) will increase as 
§(nT) = u; exp (ainT). From that, we conclude the following: The trajectory x r (t) is 
asymptotically stable if for all Floquet multipliers we have |A;| < 1; he., Re Oi < 0. It 
is unstable if for at least one eigenvalue we have |A; | > 1; he., Re 07 > 0. 

These statements, which were obtained by linearizing the equation of motion, trans¬ 
fer also to the stability behavior of the nonlinear system. The limit of marginal stability 
|OI = l may be cleared up only by additional investigations. 

For an autonomous periodically vibrating system, a peculiarity arises: In this case, 
one of the eigenvalues always has the value A = 1 and must not be considered in the 
stability analysis. To prove this assertion, we consider the function x r (t). The mode 
under consideration is namely the motion tangential to the reference orbit. For an 
autonomous system, one obtains by differentiating the nonlinear equation of motion 


x r (t)= F(x r ) 


x r (t) = 


3F 

3x 


x r = M (t)x r , 

x r 


(24.30) 


which agrees with the linearized equation of motion (24.16). The time evolution of the 
solutions of this differential equation is determined by the matrix 4>(0; hence, 


X r (t) = O(0x r ( 0). 


(24.31) 


The reference orbit x r (t) and therefore also its derivative x r (t) are however (contrary 
to the case of general perturbations §( 0 ) periodic; hence, 

x r (T) = 4>(T)x r ( 0) =x r (0), (24.32) 


which proves that the monodromy matrix has an eigenvector, namely, x r ( 0 ), with the 
eigenvalue A = 1 . This is vividly clear: A reference orbit that is shifted in the tangential 
direction simply corresponds to a shift of the time coordinate t t + 8t. Since the 
absolute value of the time does not play a role in autonomous systems, x r {t) and 
x(t) = x r (t + 8t ) are always running with unchanged distance one behind the other. 
Hence, the associated Floquet multiplier must have the value unity. 


Example 24.1 
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EXERCISE 


24.2 Stability of a Limit Cycle 


Problem. Let a nonlinear system be described by the following equation of motion: 


x = ~y+x(p-(x 2 + y 2 )), 
y=x + y(p- (x 2 + /)). 


(24.33) 


Investigate the stable solutions and find the Floquet multipliers of the limit cycle. 

Solution. A stationary fixed point exists at xo = (0, 0) T . Its stability is governed by 
the Jacobi matrix (24.7) 


3F 

M(x) = — = 
3x 


p — 3x 2 — y 2 


1 — 2xy 

which at the fixed point xq takes the form 


— 1 — 2xy 
p — 3 y 2 — x 2 I ’ 


g -;)■ 


M(X 0 ) : 

The two eigenvalues are according to Example 23.1: 
P'1/2 = S± S 2 — d = p± yj P 2 ~ (P 2 + 1) = P ± i. 


(24.34) 


(24.35) 


(24.36) 


Hence, for p < 0 one has a stable spiral and for p > 0 an unstable one; p — 0 repre¬ 
sents the special case of a rotor. 

By inspecting (24.33), one immediately finds a periodic solution for p > 0, since 
for constant x 2 + y 2 = p the system reduces to a harmonic oscillator: 

x r (t) = (.y/pcos t , «/psint) T . (24.37) 


The Jacobi matrix (24.12) evaluated at the limit cycle (24.37) reads 


M(0 = M(x r ) = 


G 


— 2 p cos 2 1 
— 2p sin t cos t 


-1 — 2 p sin t cos t 
—2 p sin 2 1 


(24.38) 


For the linearized system of equations (24.33) with this matrix M(f), the normalized 
fundamental solutions (24.12) from Example 24.1 may be given explicitly. One finds 


<t> l {t) = e~ 2pt 


( COS A 

ysin t) 


and 



(24.39) 


Combining these vectors to the matrix 0(0 and evaluating at T =2n leads to the 
monodromy matrix 


C = 0(T) = 


/ e~ Anp 

V 0 



(24.40) 


Hence, the basic solutions (24.39) are also already eigenvectors of the monodromy 
matrix, with the eigenvectors 

X\ = e~ Anp and X 2 = 1 . 


(24.41) 
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As expected, one of the Floquet multipliers has the value unity (the corresponding 
eigensolution 0 2 (O * s tangential to x r (t)). The value determines the stability of the 
limit cycle: It is asymptotically stable, since Ai < 1 for p > 0. For p < 0, no limit 
cycle exists. 

The nonlinear system (24.33) is so simple that it allows also a closed analytic so¬ 
lution. We change to polar coordinates x — r cos cp, y = r sirup. The differential equa¬ 
tions (24.33) lead to the decoupled system 

f=r(p —r 2 ), 0 = 0. (24.42) 


Hence, the angle simply increases linearly with time, (p(t) = t + <po- The radial equa¬ 
tion may be integrated as follows: 


r dr _ r l 

Jrn r (p- r 2 ) Jo 


n 0 r(p ~ r 2 ) 
or solved for r 


dt 


1 r z 

t . r ln -2 

2 p p — r z 


(24.43) 


ro 



For p > 0, the solution asymptotically approaches the limit cycle r(t) 
P < 0, 


(24.44) 
VP- For 


Exercise 24.2 



(24.45) 


any trajectory asymptotically spirals to the stable fixed point ro = 0. 











Bifurcations 


25 


As a rule, the behavior of dynamical systems is influenced by the value of one or 
several control parameters /z. These may be the strength of an interaction, the magni¬ 
tude of friction, the amplitude and frequency of a periodic perturbation, or many other 
quantities. One then frequently observes that the long-term behavior of the trajecto¬ 
ries changes qualitatively when passing a critical value /z > /z c of the parameter. For 
example, instead of a stable equilibrium position, suddenly two such positions may 
occur, or a system initially at rest may begin to oscillate. The phenomenon of addi¬ 
tionally arising solutions or of solutions that suddenly change their character is called 
branching or bifurcation ; the value /z c is called the branching value. 

The general theory of bifurcations is difficult and not yet completely worked out 
mathematically. Here we shall confine ourselves to easily tractable but important 
cases. This means for the present that we consider only local bifurcations, where the 
behavior of the system in the neighborhood of an equilibrium solution is changing. 
There exist also global bifurcations; here the topological structure of the solutions 
is modified “on a large scale,” e.g., the shape of the ranges of attraction of attractors. 
Furthermore, we consider only the one-dimensional case, which means the bifurcation 
shall arise when varying a single control parameter. (The phase space of the dynamical 
system, however, may be multidimensional.) Even under these restricting conditions a 
series of various types of bifurcations are possible. In the following, these types shall 
be classified and illustrated by simple examples. 


25.1 Static Bifurcations 

For the present, we concentrate on the stability of stationary fixed points xo character¬ 
ized by 

x = F(x 0 , /z) = 0. (25.1) 

This may be interpreted as an implicit equation for the position of the fixed point de¬ 
pending on the parameter /z, xo = xo(/z). A premise for the existence and continuity of 
this function is, according to the theorem from analysis on implicit functions, a non¬ 
singular Jacobi matrix M = 3F/3x| Xo . A discontinuous behavior, thus bifurcations, 
may therefore be expected if the determinant of M vanishes; this means if one of the 
eigenvalues of this matrix depending on /z takes the value zero. The meaning of the 
eigenvalues of the Jacobi matrix for stability has been discussed in Chap. 24. 

We now consider the typical cases in the simplest possible form, namely for a one¬ 
dimensional system. Without restriction of generality the fixed point is set to xq = 0, 
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and let the branching value be /z c = 0. This may always be achieved by appropriate 
coordinate transformations. Moreover, the one-dimensional bifurcation may be em¬ 
bedded into a higher-dimensional space. 

(a) The saddle-node branching: Saddle-node branching occurs in the dynamical 
system 


x = F{x, fi) — ii — x 2 . 


(25.2) 


This system has fixed points at 
*oi = +v^, *02 = 


(25.3) 


But, since x is real, there is no fixed point for negative \jl. If /r passes the critical 
value fi c = 0, the number of fixed points jumps from 0 to 2. The lower fixed point is 
however unstable , as is shown by the linear stability analysis according to Chap. 23. 
The eigenvalue y of the Jacobi “matrix” 


dF 
M= — 
dx 


XQ 


—2vo, thus, y = —2xo, 


(25.4) 



Fig. 25.1. Stable {solid) and 
unstable {dashed) fixed points 
for the saddle-node branching 


is, namely, positive for the solution X 02 = — *JJl. Figure 25.1 shows the stable (solid 
curve) and unstable (dashed curve) fixed points as a function of the control parameter 
ti. The arrows indicate the direction of motion, which may be immediately read off 
from (25.2). One may state that a stable and an unstable solution meet each other at 
the critical point and annihilate each other. 

We still have to clarify the origin of the name saddle-node branching. For this pur¬ 
pose, the branching is embedded in a two-dimensional space, which may be achieved, 
e.g., by 

X = IX -x 2 , 


The variables x and y are decoupled, and the solutions y{t) tend asymptotically to 
zero. The fixed points are xoi = (+ v / 7^, 0) and X 02 = (—0). The Jacobi matrix 
has the form 


(dFi/dx dFi/9y\ _2x 0\ 
\dF 2 /dx dF 2 /dy) XQ ~ \ 0 -1 ) 


(25.6) 


and is, of course, diagonal, with the eigenvalues y\ = — 2xo, yi = — 1. The two eigen¬ 
values for the solution X 02 are 2^/jZ and —1; i.e., they have distinct signs, which— 
according to the nomenclature from Example 23.1—corresponds to a saddle point. 
For xo = +y/jZ, both eigenvalues are negative, and there arises a stable node. Saddle 
and node coalesce with each other at the critical point. The distinct dynamical flows 
at /z < 0, /z = 0, pi > 0, are represented in Fig. 25.2. One clearly notes that after coa¬ 
lescing of saddle and node, there is no longer a fixed point and thus the flow continues 
to infinity (left diagram). 
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Fig. 25.2. The dynamical flow 
at a saddle-node branching 
which is embedded in a two- 
dimensional space 


(b) The pitchfork branching: The simplest example of this kind of branching arises 
if one uses a cubic polynomial for F{x, fi): 

x = F{x, j±) = /ix — x 3 . (25.7) 


Since this polynomial originates from (25.2) by multiplying by the factor x, a zero as 
solution simply adds to the former fixed points, 

*oi = +V/A *02 = -yfji, *03 = 0. (25.8) 

At the critical point fi c = 0, the number of fixed points therefore jumps from 1 to 3, 
whereby one of the latter solutions turns out to be unstable. The Jacobi matrix 

= IX-3xl (25.9) 

X 0 

has for the three fixed points the eigenvalues 

Yi = Y2 = -2 ix 2 , y 3 = ix. (25.10) 


dF 


dx 


Thus, the solution X 03 is stable below /z c , but looses this property at the critical point 
to the two branches xoi and X 02 of the “fork,” as is represented in Fig. 25.3. As com¬ 
pared with Fig. 25.1, the arrows representing the direction of flow have changed their 
orientation in the lower half-plane x < 0. This is obvious because F{x, /x) contains 
the additional factor x. 
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Fig. 25.3. Stable {solid) and 
. ^ unstable {dashed) fixed points 

^-^*7 of the pitchfork branching: 

, ^ (a) A supercritical branching. 

, \ (b) A subcritical branching 
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The pitchfork bifurcation may still arise in a second version, with the stability prop¬ 
erties exactly inverted. An example for that is 

x = F{x, fi) = fix + x 3 , (25.11) 

which differs from (25.7) by the sign. Since the flow direction and the signs of the 
eigenvalues are inverted, the branching diagram changes as represented in Fig. 25.3(b). 
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In this case, there remains only a single stable branch. The pitchfork bifurcation of 
Fig. 25.3(a) is supercritical , and that of Fig. 25.3(b) is subcritical. The two remain¬ 
ing combinations of signs of the linear and cubic term in F{x, pi) do not yield any 
qualitatively new features, they just correspond to the reflection pi —> —pi. 


(c) The transcritical branching: In this case, we consider a polynomial with linear 
and quadratic term 

x = F{x, pi) = fix — x 2 (25.12) 



Fig. 25.4. Stable {solid) and 
unstable {dashed) fixed points 
of the transcritical branching 


with the fixed points 

*01 =/x, x 0 2 = 0. (25.13) 

Thus, there always exist two fixed points in the entire parameter space. The eigenval¬ 
ues of the stability matrix are 

Ki = -/x, n = p. (25.14) 

In each case, one of the solutions is stable, the other one is unstable. At the branching 
point the two solutions change their roles; see Fig. 25.4. 


(d) The Hopf branching: The branchings considered so far were characterized by 
the fact that a real eigenvalue of the Jacobi matrix takes the value zero when varying 
the control parameter. However, branchings may also occur for complex eigenvalues. 
Since the eigenvalues always occur in the form of complex-conjugate pairs, the sys¬ 
tem must have at least the dimension 2. As an example, we consider the system of 
equations 


x = -y+x(n - (x 2 + y 2 )), 
y =x+y(n-(x 2 +y 2 )). 


(25.15) 


This system has been investigated already in Exercise 24.2 in the context of the sta¬ 
bility of limit cycles. For all values of pi the origin is a fixed point, xo = (0, 0). At this 
position the Jacobi matrix has the value 




(25.16) 


with the eigenvalues y\ = pi + i, y 2 = pi — i. This means that the fixed point for 
pi < 0 is a stable spiral, and for pi > 0 an unstable spiral. The fate of the stable 
solution at the critical point differs from that in the cases considered so far. From 
the stationary attractor xo there evolves for pi > 0 a periodically oscillating solution 
x r {t) = {^jlcost, ^/JZsint), which turns out to be a stable limit cycle. 

As is seen from Fig. 25.5, the bifurcation diagram resembles that of the pitchfork 
branching, Fig. 25.3. Actually, the system of (25.15) may be decoupled by changing 
to polar coordinates v = r cos0, y = r sin0. The equation of motion for the radius 
r{t) then exactly coincides with (25.7); see (24.42) in Exercise 24.2. The new state is, 
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Fig. 25.5. For a Hopf branch¬ 
ing, a fixed point converts into 
a limit cycle 


however, independent of time only in a “convected rotating” coordinate system, since 
the phase angle increases linearly, (p(t) = t. In the original phase space the behavior 
of the attractor therefore changes qualitatively: A static solution becomes a dynamic 
solution. Branchings of this kind were first investigated in a systematic way 1 in 1942 
by the mathematician Eberhard Hopf. 2 

As for the pitchfork branching, there are also two versions of the Hopf branching. 
In addition to the supercritical case represented here, there exists also a subcritical 
Hopf bifurcation with the opposite stability properties. In practice this version is of 
less interest, due to the fact that no stable limit cycle arises as an attractor. 

One might consider the branchings presented in (a) to (d) as particularly simple 
special cases. But it has been shown under rather general premises that the change of 
stability (characterized by the zero passage of eigenvalues of the Jacobi matrix) for 
variation of a single control parameter always proceeds according to one of these four 
scenarios. We cannot deal here with the more detailed mathematical foundation, but 
refer for an introduction, e.g., to G. Faust, M. Haase and J. Argyris, Die Erforschung 
des Chaos , Vieweg, 1995, Chap. 6. 3 


25.2 Bifurcations of Time-Dependent Solutions 

It may also happen for periodic trajectories that their character changes stepwise un¬ 
der variation of a parameter p. We shall only briefly touch on the bifurcation theory 
of periodic solutions and vividly illustrate some interesting aspects. The mathematical 
tool is the Poincare mapping introduced in Chap. 24. A periodic orbit x r (t) is charac¬ 
terized by a fixed point x r o in the Poincare cut. The discretization of the neighboring 
path x(t) consists of a sequence of points xo, xi, X 2 ,.... For the distance between the 
two orbits, we have according to (24.10) 


- x r0 = C n (x 0 - x r0 ), 


(25.17) 


1 E. Hopf, Abh. der Sachs. Akad. der Wiss., Math. Naturwiss. Klasse 94, 1 (1942). 

2 Eberhard Friedrich Ferdinand Hopf, b. April 17, 1902, Salzburg-d. July 24, 1983, Bloomington, 
Indiana. Hopf studied mathematics in Berlin and taught at MIT (1932 to 1936), and at the universities 
of Leipzig (1936 to 1944) and Munich (1944 to 1948), as well as at Indiana University, Bloomington 
(from 1948). His main research fields were differential and integral equations, variational calculus, 
ergodic theory, and celestial mechanics. 

3 This was also translated into English and published as G. Faust, M. Haase and J. Argyris, An 
Exploration of Chaos, North-Holland (1994). 
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25 Bifurcations 


Fig. 25.6. Various kinds of ap¬ 
proaches to a fixed point 


Fig. 25.7. Typical branchings 
of a periodic trajectory 
(dashed) 


where the matrix C represents the linearized approximation of the Poincare map¬ 
ping P. As long as all eigenvalues A; of C fall into the complex unit circle, |A; | < 1, 
the neighboring orbits are attracted and the solution x r (t) is a stable limit cycle. 

The approach x n x r o may proceed in different ways, as will be illustrated for a 
selected eigenvalue, say, Ai. If the eigenvalue is real and positive, 0 < Ai < 1, the x n 
approach the fixed point x r o monotonically, as is shown in Fig. 25.6(a). The axis in 
this diagram corresponds to the direction of the eigenvector of C belonging to Ai. If 
the eigenvalue is real and negative, —1 < X\ < 0, the x n form an alternating sequence; 
see Fig. 25.6(b). Finally, for a pair of complex eigenvalues, X\ = A|, |Ai \ < 1, the x n 
lie on a spiral converging toward x r o; see Fig. 25.6(c). 


0 < X x < 1 


-1 < Ai<0 


IA 1 I = IA 2 I<1 


2 * 


43 2 1 0 


1 3 4 2 0 


(a) 


(b) 


(c) 


Branchings occur if an eigenvalue A leaves the unit circle at a critical value of the 
control parameter /z c . One distinguishes three possible cases: 


(a) A i — +1. 

According to (25.17), in this case the distance of a neighboring trajectory from the 
reference trajectory does not change. (A deviation along the direction of the eigen¬ 
vector of the matrix C is multiplied by Ai = 1 for each cycle.) This indicates that 
for /x > /z c , new limit cycles may arise that have the same period length T as the ref¬ 
erence orbit x r . Similar to the bifurcation of a stable fixed point, a periodic solution 
may also undergo a pitchfork bifurcation and split into two separated solutions. This is 
sketched in Fig. 25.7(a). The other bifurcations from the last section are also possible. 
Which of these cases is actually realized cannot be read off from the criterion Ai = +1 
alone. 


(b) |Ai| = |A 2 | = 1. 

In this case as well, the sequence of points x n of the Poincare mapping remains 
at a constant distance from x r o but now rotates on a circle. In the Poincare cut itself, 
a limit cycle evolves. This means geometrically that the topology of the periodic or¬ 
bit changes: It now lies on a torus which envelops the originally closed orbit, as is 
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shown in Fig. 25.7(b). If the two circulation frequencies on the torus mantle are in¬ 
commensurable, i.e., do not form a rational ratio p/q, p,q e N, then one speaks of a 
quasiperiodic motion , since the orbit for infinitely large times never closes. It thereby 
approaches any point on the torus surface to arbitrarily close distance. 


(c) k\ = -1. 

This case is of particular interest, since the distance of the points x n remains con¬ 
stant, but the direction is alternating. This means that the neighboring orbit x after each 
second passage returns to its old position. There arises a periodic orbit with twice the 
period length 2 T, as is indicated in Fig. 25.7(c). The phenomenon is therefore called a 
period-doubling or subharmonic bifurcation. Bifurcations of this kind play an impor¬ 
tant role in the transition from periodic to chaotic motion. An explicit example will be 
discussed in Example 27.1 in the context of logistic mapping. 



Lyapunov Exponents and Chaos 


26 


In Chap. 24, the concept of stability of time-dependent orbits was discussed, and in 
Example 24.1, Floquet’s theory of stability, which may be applied to periodic paths, 
was explained in more detail. Building on the works of Floquet and Poincare, the 
Russian mathematician Fyapunov 1 published in 1892 an even more general study of 
the stability problem in which arbitrary and also nonperiodic motions were admitted. 
The characteristic exponents introduced by Fyapunov have played a central role in the 
theory of nonlinear systems. 


26.1 One-Dimensional Systems 

We first consider the case of a one-dimensional discrete mapping 

x„+i=f(x n ). (26.1) 

Physically, this may be for example the Poincare mapping of a dynamical system. We 
now ask, how does the point sequence xo, x\, X 2 ,... differ from the point sequence 
x 0 ,xi,x 2 ,... that evolves from a slightly modified initial condition xo = xo + <$xo? 
We have, in general, 

x n = x n + 8x n = f{x n -1 + 8x n -i ) 

= f(x n - 1) + f'(x n -i)8x n -1 4-; (26.2) 

thus, the deviation in the nth step in linear approximation is 

8x n = f(x n -i)8x n - 1 . (26.3) 

This equation can be applied n times recursively, with the result 

8x n = f'(x n -i)f'(x n - 2 ) • • • fix o) 8xo (26.4) 

n— 1 

= Y\f'(x l )8x 0 . (26.5) 

1=0 


1 Alexander Mikhailovich Lyapunov, Russian mathematician, b. June 6, 1857, Yaroslavl-d. Novem¬ 
ber 3, 1918, Odessa. Lyapunov was a scholar of Chebyshev. He investigated the stability of equi¬ 
librium and the motion of mechanical systems and the stability of rotating liquids. Lyapunov also 
worked in the fields of potential theory and probability theory. 


W. Greiner, Classical Mechanics, 

DOI 10.1007/978-3-642-03434-3_26, © Springer-Verlag Berlin Heidelberg 2010 


503 






504 


26 Lyapunov Exponents and Chaos 


Obviously, the values f'(xi ) are a measure of how fast the neighboring solutions x n 
and x n go away from each other (or move towards each other). In the special case of 
a periodic motion which was studied by Floquet the point sequence jc/ (interpreted as 
a Poincare mapping) is constant and all factors in (26.4) have the same value. This 
yields the exponential relation 

\8x n \ = \f'(xo)\ n \8xo\=e™\8x 0 \ with a = In |/'(* 0 )l, (26.6) 


compare (24.25) in Example 24.1. (A difference in the definition of a is that in (26.6) 
one considers magnitudes. This implies that a here is always real.) The characteristic 
exponent a determines the “growth rate” of a perturbation. Even if the points X[ differ 
from each other (nonperiodic orbit), the exponent a may be further used by form¬ 
ing a mean value. The mathematical definition of the Lyapunov exponent of a point 
sequence x n is 


1 

o = lim lim - In 

n—>oo 8xq~^ 0 n 


8x n 

8xo 


(26.7) 


Using (26.4) and the multiplication rule for the logarithm, we can also write this as 


it X 

a = lim - V In |/'(*/) I- (26.8) 

n—>oo n ^— 4 
1=0 

If all xi are equal, the quantity a of (26.7) or (26.8) obviously reduces to the special 
case (26.6). 

The Lyapunov exponent is a logarithmic measure for the mean expansion rate per 
iteration (i.e., per unit time) of the distance between two infinitesimally close trajec¬ 
tories. 

The case o > 0 is of particular interest. A dynamical system with a positive Lya¬ 
punov exponent is called chaotic. The paths of such a system are extremely sensitive 
to changes of the initial conditions. Because of the exponential dependence there is 
no need for long waiting, and an initially small deviation 8x o explodes to arbitrary 
magnitude. More strictly speaking, it is sufficient that the product ncr be a number not 
very much larger than unity; compare (26.6). 

This property of chaotic systems is very significant, both practically as well as con¬ 
ceptually. The behavior of a chaotic system is not predictable , at least not over a long 
time period. Since physical quantities can always be determined only with a limited 
precision, <5vo inevitably has a value that differs from zero. Therefore, it is hopeless 
to aim at predicting the state of a chaotic system for times that are significantly larger 
than 1 /a. The attempt to reach that by more and more precise fixing of the initial 
conditions is doomed to failure. Ultimately, the exponential increase of the deviation 
will always win. 

The fascinating point is that this effect arises in a completely deterministic system. 
The dynamics of such a system is “in principle” mathematically uniquely fixed by 
the basic equation of motion—be it a differential equation or a discrete mapping as 
in (26.1). Nevertheless, an exact knowledge of the equation of motion cannot help in 
the attempt to find the solutions. In this way a new kind of uncertainty is brought into 
physics, in addition to the more familiar sources of the statistical fluctuations (noise) 
and the quantum fluctuations (uncertainty relation). The first to clearly understand and 
state this phenomenon was Henri Poincare toward the end of the nineteenth century. 
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Considering the rapid development of other branches of physics, nonlinear dynamics 
has lived in the shadows for a long time. The current interest in the chaos theory 
has been inspired significantly by the proliferation of electronic computing facilities, 
which allow for exploring the dynamics of systems resisting the analytic treatment. 

An important impetus for dealing with chaotic systems was given by the meteorol¬ 
ogist Edward N. Lorenz, whose work 2 received little attention for a long time. In 1963, 
he derived from the basic equations of hydrodynamics, under strongly simplifying as¬ 
sumptions, a system of three coupled nonlinear differential equations for describing 
a meteorological system. When working on their numerical solution, he discovered 
the signature of deterministic chaos: nonpredictability due to sensitive dependence on 
the initial conditions. Nobody will be surprised that meteorology offers an appropriate 
example for that phenomenon. Finally, weather may be predicted reliably even with 
massive computer effort for at most a few days in advance. The attempt to extend this 
space of time requires an exponentially increasing effort. 

Lorenz has also coined the concept of the “butterfly effect” in this context, which 
entered public awareness: The flapping of wings of a butterfly in Brazil decides 
whether Texas is hit by a tornado some days later. Of course one should not just flog 
this picture to death. Even all butterflies together cannot change the structure of the 
“climatic attractor” by their flapping. Tropical whirlwinds will occur over and over 
again, at least as long as no global change in climate happens. But at what time and at 
which place a thunderstorm occurs depends, however, so sensitively on subtle details 
that even a butterfly may affect it. 


26.2 Multidimensional Systems 

The definition of the Lyapunov exponent given here for one-dimensional systems with 
discrete dynamics may be generalized to several dimensions and to a continuous time 
evolution. The discussion of the stability of periodic orbits in Chap. 24 may serve as 
a starting point. There we have discussed trajectories x(t) = x t {t) + %(t) that differ 
from a reference orbit x r (t) by a small perturbation %(t). To first approximation the 
perturbation obeys a linear differential equation; compare (24.6), 


$(0 = M(0$(0, 


(26.9) 


with the Jacobi determinant M = 3F/3x| Xr (f). In agreement with (26.7), as a measure 
for the time evolution of the perturbation one may define the quantity 



CT Xr , ? = lim In 


(26.10) 


This definition of the Lyapunov exponent raises several mathematical problems that 
can only be touched on here. First of all, cr% r ^ 0 depends on the reference trajectory 
x r (t) and therefore on the position of the starting point. But if the system has an at¬ 
tractor, then the value of a in the range of attraction of the attractor is independent 
of the reference orbit. Moreover, there exists the important class of ergodic systems 
for which the mean values with respect to the time (taken along an orbit) can be re¬ 
placed by mean values in phase space. One can show also for ergodic systems that 


E.N. Lorenz, J. Atmos. Sci. 20 , 130 (1963). 


2 
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the Lyapunov exponents defined according to (26.10) exist and are independent of the 
special reference trajectory. (More strictly speaking, there may occur “pathological” 
orbits with differing a, but these form a set of measure zero ). 3 

Furthermore, the value of a Xr ^ () depends on the direction of the perturbation 
§0 = In an N -dimensional space, one may construct N linearly independent 
vectors e* that lead to a set of N Lyapunov exponents 

cr i =cr x r ,ei • (26.11) 

The indices are chosen such that the 07 are in descending order, 


o\ > ^2 > • • • > cr N , 


(26.12) 


where degeneracies (equality of several values) are possible. 

The value of the largest Lyapunov exponent o\ is the most easily determined one. 
For this purpose the linearized equation of motion (26.9) is integrated with an initial 
perturbation § (to) chosen at random. Such a vector may be decomposed according to 

N 

= (26.13) 

i =1 

and will always have also a component along the vector ei. When tracing the solution 
over a sufficiently long time interval, the most rapidly increasing component of the 
perturbation (or, if all 07 are negative, the most slowly decreasing component) will 
dominate. Performing the limit in (26.10) then guarantees that the calculation yields 
the maximum Lyapunov exponent. In practice one has to take care in such a calcula¬ 
tion that the trajectory x(t) for chaotic systems moves away from the reference trajec¬ 
tory x r (t) very rapidly. It is therefore recommended that one performs a rescaling of 
the perturbation (t n ) = c%(t n ) in regular time intervals, with a constant c 1 ; see 
Fig. 26.1. The value of a is then obtained by averaging over many time intervals. 


Fig. 26.1. In the numerical 
calculation of Lyapunov 
exponents one performs a 
rescaling in regular time steps, 
because of the divergence of 
the trajectories 



The full set of the N Lyapunov exponents may be calculated by following the time 
evolution of all N linearly independent perturbations § z - with § z (fo) = e,-. From the 
N volumes of the parallelepipeds spanned by the £ 1? § 2 > • • • (that must be 
calculated for all values p = 1,2,..., N), one then may obtain successively all 07. 4 

We still note that for periodic orbits x r (t + T) = x r (t), the Lyapunov coefficients 
coincide with the real part of the Floquet exponents introduced in Example 24.1. Thus, 
one has a generalization of this concept. 


3 More information on these questions may be found, e.g., in D. Ruelle, Chaotic Evolution and 
Strange Attractors, Cambridge University Press, Cambridge (1989). 

4 G. Bennettin, C. Froeschle and J.P. Scheidecker, Phys. Rev. A19, 2454 (1979); T.S. Parker and 
L.O. Chua, Practical Numerical Algorithms for Chaotic Systems, Springer, New York (1989). 
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The Lyapunov exponents are decisive for the long-term evolution of a dynamical 
system. As discussed already, positive a imply a rapid divergence of neighboring 
trajectories and nonpredictability. Of particular interest are trajectories which attract 
neighboring solutions and have (at least) one positive Lyapunov exponent. They are 
called chaotic attractors. 

Let us consider for illustration an autonomous system with three degrees of free¬ 
dom. Depending on the combination of signs of ( o\ , 02 , 02 ), various kinds of attractors 
may occur: 


(cr\, < 72 , <^ 3 ) 


(---), 
( 0 ,-,-), 
(0, 0 , -), 
(+> 0 , —), 


fixed point, 
limit cycle, 
torus, 

chaotic attractor. 


These cases are illustrated in Fig. 26.2. 



Fixed point 
(-, - -) 



Torus 

( 0 , 0 , -) 


Fig. 26.2. Distinct types of at¬ 
tractors in a dynamical sys¬ 
tem with three degrees of free¬ 
dom. The pictures differ by 
the signs of the Lyapunov ex¬ 
ponents 

Limit cycle 

( 0 , -) 




Chaotic attractor 

(+, 0, -) 


(a) If all Lyapunov exponents are negative, there arises a stable fixed point to which 
the neighboring trajectories from all directions are converging. 

(b) The vanishing of a Lyapunov exponent, o\ = 0, indicates the existence of a pe¬ 
riodic motion. This has been demonstrated explicitly in Chap. 24, based on the 
equation of motion. The vector ei associated with o\ points along the direction 
of the tangent of the orbit. The attractor is a limit cycle , i.e., a one-dimensional 
object with the topology (but not necessarily the geometric shape) of a circle. 

(c) If two of the Lyapunov exponents vanish, o\ = 02 = 0, there exists a periodic 
motion in two directions. Therefore the attractor is two-dimensional and has the 
topology of a torus about which the trajectory is winding up. Whether or not the 
trajectory is periodic in total depends on the circulation frequencies oo\ and 002 for 
the two degrees of freedom of the torus. If the values oo\ and 002 are incommen¬ 
surable, i.e., the ratio 001/002 is not a fraction of integer numbers but an irrational 
number, the orbit will never close. Such an orbit that with increasing time covers 
the torus more and more densely is called quasiperiodic. 
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(d) If the largest Lyapunov exponent is positive, o\ > 0, there arises a chaotic attrac¬ 
tor with the already discussed properties of irregular motion that depends strongly 
on the initial conditions. The typical combination is o\ > 0, 02 = 0, 03 < 0, but 
chaotic attractors with several positive Lyapunov exponents may also occur. Geo¬ 
metrically the chaotic attractors usually are also strange attractors , as mentioned 
already in Chap. 23. They have the strange property that they are objects with 
a broken dimension. These are neither lines nor surfaces (or higher-dimensional 
hypersurfaces) but “something in between.” That such objects are not pure math¬ 
ematical inventions but also occur in nature has been noted only recently, in par¬ 
ticular by B. Mandelbrot, 5 who named them fractals . 6 A beautiful example of 
a fractal attractor will be given in Example 27.4 in the context of a periodically 
driven pendulum; compare Fig. 27.19. 


26.3 Stretching and Folding in Phase Space 

How can a strange attractor with infinitely branched internal structure evolve in a 
dynamic system? The answer lies in the mechanism of the stretching and folding in 
phase space, which may be visualized as follows. 

We consider a dynamical system with a positive Lyapunov exponent o\. Moreover, 
let the system be organized in such a way that the accessible part of phase space is 
restricted (in Chap. 23 this property was included in the definition of attractors). We 
now consider a small connected domain AV in phase space, e.g. a cube, and fol¬ 
low its deformation under the phase flow . Neighboring points rapidly drift apart 
in the direction belonging to the Lyapunov exponent o\. Since the phase-space vol¬ 
ume is conserved in conservative systems, or even shrinks in dissipative systems (see 
Chap. 23), a contraction must take place in the other directions. Since on the other 
hand the available domain of phase space is restricted, the trajectories must bend back 
again. The test volume A V which is stretched at first will thus be folded back, as is 
schematically shown in Fig. 26.3. 

Fig. 26.3. A strange attractor 
may develop if the dynamical 
flow continuously stretches a 
domain in phase space and 
then folds it back again 

If one follows the history of AV still further, the game of stretching and fold¬ 
ing will continue again and again. One may imagine that in this way a infinitely fine 



5 Benoit B. Mandelbrot, b. November 20, 1924, Warsaw. After the emigration of his family to France 
(1936), Mandelbrot studied in Lyon, at the California Institute of Technology, and in Paris, where he 
did his doctorate in 1952. He worked at CRNS, in Geneva, and at the Ecole Polytechnique before 
he went in 1958 to the IBM Watson Research Center, where he was appointed as Research Fellow. 
He served as visiting professor among others at Harvard and Yale. Mandelbrot’s interests are extra¬ 
ordinarily broad and oriented interdisciplinarily. Building on the work of G.M. Julia (1893-1978) on 
iterated rational functions, he demonstrated the properties of fractals using computer graphics and 
pointed out their manifold occurrence in nature. Besides many other awards, Mandelbrot received the 
Wolf Prize in physics in 1993. 

6 For more on this point, see, e.g., B. Mandelbrot, The Fractal Geometry of Nature, Freeman (1982); 
H.-O. Peitgen, H. Jurgens and D. Saupe, Chaos and Fractals: New Frontiers of Science, Springer 
(1992). 
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subdivided puff-pastry-like structure develops, with the geometrical properties of a 
fractal. Two initially closely neighboring points end up after a while on two dis¬ 
tinct layers of the “pastry,” after which any recognizable correlation between their 
positions is lost. A simple mathematical model for the mechanism described here is 
the baker transformation (called so in analogy to the kneading of pastry); see Exam¬ 
ple 26.1. 

26.4 Fractal Geometry 

The long-term behavior of the trajectories of dissipative systems is characterized by 
the approach toward attractors that are embedded in the phase space as geometric 
forms of lower dimension. The simplest cases include fixed points (dimension D = 0), 
limit cycles (D = 1), and tori (D = 2) for quasiperiodic motion with two incommensu¬ 
rable frequencies. As new types of attractors in nonlinear systems, strange attractors 
with more complicated geometric structure may arise which are characterized by a 
broken dimension. The first fractals of this type, still without this name, were intro¬ 
duced by mathematicians as abstract constructions. For a long time, they seemed to 
belong into the mathematical cabinet of oddities, until it was discovered that fractal 
structures actually frequently occur in nature. 

The geometric construction of a fractal is usually based on a (mostly simple) it¬ 
eration rule that is applied repeatedly. In the limit of infinitely many iteration steps, 
the fractal arises which thus has an infinitely fine resolved internal structure. We shall 
briefly consider several familiar examples. 

(1) The Cantor set: The construction begins with a line, namely, the set of all real 
numbers in the unit interval 7o = [0, 1]. The iteration rule reads: Remove the mean 
third in each interval. In the first iteration step there arise two disjunct partial inter¬ 
vals I\ = [0, 1/3] U [2/3, 1] which then split further into I 2 = [0, 1/9] U [2/9, 1/3] U 
[2/3, 7/9] U [7/9, 1] etc. The first iteration steps are represented in Fig. 26.4(a). In the 
limit n 00 from the I n , there arises the Cantor set 7 as a kind of finely distributed 
dust of points in the unit interval. 

To get a measure of extension of the Cantor set, we consider the magnitude of its 
complementary set, i.e., the total length of all parts cut out. This leads to a geometric 
series: 



(26.14) 


7 Georg Cantor , German mathematician, b. March 3, 1845, St. Petersburg, Russia-d. January 6, 
1918, Halle. Cantor studied at the universities of Zurich and Berlin under WeierstraB, Kummer, and 
Kronecker. From 1869 to 1913, he was a professor at Halle. Cantor is the founder of set theory. 
He invented the notion of cardinal numbers and the concept of infinite (transfinite) numbers and dealt 
with the definition of the continuum. He proved the nondenumerability of real numbers. Furthermore, 
he contributed to the theory of geometric series. 
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Fig. 26.4. (a) Iterative con¬ 
struction of the Cantor set: In 
each step, the mean third of 
an interval is cut out. (b) Iter¬ 
ative construction of the Koch 
curve. It originates by contin¬ 
ued adding of triangles 
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II II II II II II II II 
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(a) 


0 



(b) 


The parts cut out thus add up to the total length of the unit interval, and the length 
of the Cantor set is therefore zero! But it consists of infinitely many points, and 
one can show that its cardinal number (power) is the same as that of the real num¬ 
bers. 8 


(2) The Koch curve: This construction again begins with a straight line of length 
unity. Instead of cutting out parts, something is added, and the straight line is built up 
to a toothed curve in the two-dimensional plane. The iteration rule reads: Remove the 
mean third of every straight partial piece and replace it by the sides of an equilateral 
triangle. The first steps of the iteration are shown in Fig. 26.4(b) (they remind one of 
the bulwarks of old fortresses). 

The peculiar feature of the Koch 9 curve and its related ones is that it is everywhere 
continuous but nowhere differentiable. One cannot give a tangent to the curve, because 
of the infinitely many sharp corners. The calculation of the length of the Koch curve 
also leads to a remarkable result: In every partial step, always 3 partial pieces are 
replaced by 4 of the same length. The total length L is therefore 


/4\ n 

L = lim L n = lim I - I = oo, 

n—*oo ti —>oo \ 3 ) 


(26.15) 


and thus diverges. This cannot be seen directly from the graph of the curve, due to the 
conversion to smaller and smaller length scales. Here the self-similarity becomes ap¬ 
parent which is a typical feature of fractals: On any length scale, a linear magnification 
of a detail of the object again resembles the entire object. 10 


8 This may be understood in a rather simple manner: Every point of the Cantor set may be charac¬ 
terized by an infinite sequence of “left-right decisions”; i.e., for every iteration step in Fig. 26.4 one 
must say in which of the two partial intervals the point lies. But this sequence may also be interpreted 
as the binary representation of a real number in the unit interval [0, 1], whereby the assertion becomes 
clear. 

9 Niels Fabian Helge von Koch , Swedish mathematician, b. January 25, 1870, Stockholm-d. March 
11, 1924, Stockholm. Koch was a scholar and successor of Mittag-Leffler at the University of Stock¬ 
holm. His main research fields were systems of linear equations of infinite dimension and with infi¬ 
nitely many unknowns. His name became familiar mainly from the curve named after him. 

10 Self-similarity alone is, however, not a sufficient criterion for fractals. For example, a straight line 
is self-similar in a trivial manner. 
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(3) The Sierpinski gasket: The basic element of the Sierpinski 11 gasket is a two- 
dimensional area, namely, an equilateral triangle. Iteration rule: Subdivide each trian¬ 
gle into 4 congruent parts and remove the central triangle. Figure 26.5 shows the first 
steps of this iteration. The resulting object is something between an area and a curve. 

It has again the property of self-similarity. 

A Fig. 26.5. Iterative construc¬ 
tion of the Sierpinski gasket. 
In each step, the middle of 
4 partial triangles is removed 

Nature offers a variety of fractal objects. Example from the organic world are the 
branchings of plants (trees, cauliflower, particularly beautiful ferns) or vessels. In¬ 
organic fractal shapes are observed in clouds, mountains, snowflakes, lightning dis¬ 
charges, etc. 

A classical example studied by Mandelbrot that resembles the Koch curve are 
coastlines. For the length of one and the same coast the geographers may give quite 
different values, depending on the length scale adopted in the measurement. The 
smaller the scale, the better is the scanning of the bays and windings of the coast, 
corresponding to the higher iterations L n of the Koch curve. Ultimately the coastline 
should wind about each individual grain of sand on the beach, which would blow up 
the length enormously. But here also shows up that the application of fractal geome¬ 
try to natural objects is meaningful only in a certain range. Ultimately on the atomic 
scale when the granularity of matter becomes apparent, the mathematical limit n -> oo 
loses its meaning. Nevertheless, the fractal scale behavior (see below) frequently can 
be traced over many orders of magnitude. 

The fractal dimension: An important tool for characterizing geometric objects is 
their dimension. For ordinary objects (smooth curves, areas, spheres, ...), the dimen¬ 
sion is clear and coincides with the visual conception. Fractals, however, behave dif¬ 
ferently. To characterize them, the concept of fractal dimension was invented. We will 
restrict ourselves mainly to the so-called capacity dimension (or box counting dimen¬ 
sion). 12 

The dimension shall be a measure of how dense a point set fills the space into which 
it is embedded. We start from the assumption that the space is a metric (normally 
Euclidean) space in which one can measure distances. This space is then, as illustrated 
in Fig. 26.6, covered by a grid of lateral length € (i.e., distances for ft = 1, boxes for 
ft = 2, cubes for ft = 3, hypercubes for n > 3) and one counts how many of the boxes 
contain one or several points of the point set under consideration. This number N(e) 



11 Waclaw Sierpinski, Polish mathematician, b. August 20, 1882, Warsaw-d. May 14, 1969, Warsaw. 
Sierpinski was a professor of mathematics in Lwow (now Ukraine, 1908-1914), Moscow (1915— 
1918), and later in Warsaw. His main fields of work were the theory of sets (here, in particular, the 
selection axiom and the continuum hypothesis), the topology of point sets, and number theory. 

12 The situation is complicated by the fact that a multitude of distinct mathematical concepts of di¬ 
mension are available. The calculated dimension values partly agree with each other but partly do 
not, depending on the considered object. For details, see K. Falconer, Fractal Geometry, Mathemati¬ 
cal Foundations and Applications, Wiley, New York (1990). 
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Fig. 26.6. The capacity di¬ 
mension of an object is deter¬ 
mined by counting how many 
boxes of a grid are touched 
by the object. The depen¬ 
dence of the number N(e) 
on the box size e determines 
the dimension Df. This is il¬ 
lustrated here for a “normal” 
two-dimensional object 




will in general depend on the box size e. For an improved resolution the object for sure 
will spread over more boxes, but the question is how fast that happens. If the scaling 
behavior in the limit e —> oo follows a power law 


N(e) = V(e) 6 Df with V ( 6 ) -> constant for e ^ 0, (26.16) 


then the value of the exponent defines the fractal dimension or capacity dimension Df. 
Solving yields 


„ XnN (e) — In V (e) In N(e) 

Df = lim-= lim -, 

€ —>-0 ln(l/e) €->o ln(l/€) 


(26.17) 


since V (e) remains finite in the limit and therefore does not contribute. 

For nonfractal geometric objects, the dimension determined in this way coincides 
with the normal Euclidean dimension, and V (0) corresponds to the Euclidean vol¬ 
ume. For example, for sufficiently high resolution a circular disk of radius R overlaps 
N(c) — 7tR 2 /e 2 boxes, and each further doubling of the resolution increases the num¬ 
ber of boxes by the factor 4. Hence, one finds for the circle N(c) ~2 ttR/c, and so 


For fractals, however, the power of the scaling law (26.16) differs from the naively 
expected value and is in general not an integer. For the Cantor set the determination 
of Df is particularly simple. Here it suffices to take the unit interval (n = 1 ) as the 
embedding space. It is most convenient to consider a sequence of subdivisions of the 
length €i that always differ by the factor 3; thus, eo = 1, e\ — 1/3, 62 = 1/9, etc. 
The Cantor set is constructed in such a way that then the number of “occupied” boxes 
(partial intervals) always doubles; thus, N(eo) = 1, N(e\) = 2, N(e 2 ) = 4, etc. Hence, 
the fractal dimension of the Cantor set is, according to (26.17), given by 


„ r lntf(e) r In N(c n ) 

Df = lim-= lim - 

€-*0 fil(l/e) n ^0 o ln(l/ 6 n ) 


In 2 n In 2 

: lim -=-~ 0.6309. 

n^oc\n3 n In 3 


(26.18) 


The result is independent of the manner of performing the passage to the limit 

€ —> 0 . 

Similarly, one finds the dimension of the Koch curve. To cover it, in the first step 
one needs N(e\) = 4 intervals of length c\ = 1/3, in the next one N(e 2 ) = 16 intervals 
of length €\ = 1/9, etc. As in (26.18), this implies 
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Finally, for the Sierpinski gasket we have 
In 3 

Df= -~ 1.5850, (26.20) 

In 2 

since here for each bisection of the box size the number of partial objects increases by 
the factor 3. 

The results (26.18) to (26.20) are not implausible. They quantify how the consid¬ 
ered fractals by their properties stand between the familiar objects point, line, area, 

_The comparison of (26.19) and (26.20) shows that the Sierpinski gasket is “more 

space-filling” than the Koch curve, but does not come up to a normal area. An ex¬ 
treme example in this respect is the area-covering curve that was discovered in 1890 
by G. Peano 13 and investigated in modified form by D. Hilbert. The Peano curve may 
also be obtained iteratively: In a subdivision of the scale length into three sections 
there arise nine partial distances of equal length; see Fig. 26.7. Accordingly, the di¬ 
mension is 

In 9 

Df= -=2. (26.21) 




Fig. 26.7. The first two iter¬ 
ation steps in the construc¬ 
tion of the area-filling curve 
of Peano. In the limit there re¬ 
sults a continuous mapping of 
the unit square onto the inter¬ 
val [0, 1]. To make the con¬ 
struction transparent, the rec¬ 
tangular edges were cut off in 
the plot 


This is the dimension n = 2 of the embedding space and hence the largest value that 
may be taken by the capacity dimension Df. N(e) takes the maximum value, since 
all boxes include parts of the object. The definition of the capacity dimension is very 
clear and has the advantage that it provides immediately an operative calculation rule. 
One only has to span grids and to count the boxes, which may easily be done on a 
computer. If the function A (6) in a doubly logarithmic representation yields a straight 
line (at least over a larger range of scale), then one can immediately read off Df from 
its slope. A related but more subtle definition of the dimension which refrains from 
equidistant grids and works with overlaps of variable magnitude was developed in 


13 Giuseppe Peano, Italian mathematician, b. August 27, 1858, Cuneo (province Piemont)-d. April 
20, 1932, Torino. Peano studied mathematics at the University of Torino, where he taught beginning 
in 1880 as a lecturer and beginning in 1890 as a professor. His early works concerned analysis, 
the initial-value problem of differential equations, and recursive functions. Peano emerged mainly 
as a founder of the mathematical logic (with G. Frege). The Peano axioms (1889) define the natural 
numbers via the properties of sets. His aim was the axiomatization of all of mathematics. Later, Peano 
moved beyond mathematics and developed an universal world language (Interlingua) that, however, 
did not gain acceptance. 
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1918 at Bonn University by the mathematician Felix Hausdorff. 14 We shall not en¬ 
ter into the details here and mention only that the Hausdorff dimension Du in the 
most cases coincides with Df , 15 but there also exist exceptional cases. Generally, 
Du < Df. 

Just for the classification of strange attractors, other dimension measures may also 
be meaningful. The Poincare mapping yields a possibly very inhomogeneously dis¬ 
tributed cloud of points in phase space. In the calculation of the capacity dimension 
the information on the frequency distribution of the points is ignored. There it has 
no meaning whether a box is occupied by a single point or by thousands. To take 
this quantity into account, one defines an information dimension , which includes the 
density distribution of the points. 

The construction is similar to that for the capacity dimension Df. The embedding 
space is again subdivided into boxes, but now the contribution of every cell is weighted 
by the probability pi of meeting points there. Practically, this value is determined by 
generating a very large number N of points and counts how many of them fall in the 
cell number i ; thus, pt = Ni/N with JT Ni = N. The weighting is performed with a 
logarithmic measure. The information dimension D\ is defined as 


D\ = lim 

6->0 


Hi Pi Ml/Pi) 

ln(l/e) 


(26.22) 


The factor /(/?) = pln(l/p) has the following meaning: Take an arbitrary point from 
the distribution and ask “Does the point lie in cell /?” The answer to this question 
yields the information set f(pt). This function vanishes for pi -> 0 and pi 1, since 
in these cases the answer is trivial (always “no” or always “yes”). The information gain 
is maximum for pi = 1/2; then the answer is in any case a surprise. 

The exact foundation for the function f(p) is provided by the statistical mechan¬ 
ics, or by information theory (Shannon’s information measure). 16 But at least we may 
easily see that (26.22) turns into the formula (26.17) if the point distribution is ho¬ 
mogeneous, i.e., if all probabilities in the in total N(e) covered cells have the same 
value pi = p = l/N(€). For the remaining cells pi = 0. Thereby, (26.22) reduces 
to 


D\ = lim 

€->0 


N(€) P m/ P ) 

ln(l/6) 


r In N(€) 

lim- 

€->oln(l/€) 


= Df. 


(26.23) 


For a less homogeneous point distribution, the informational content is lower and one 
can show that then D\ < Df V 


14 Felix Hausdorff\ German mathematician, b. November 8, 1869, Breslau-d. January 26, 1942, 
Bonn. Hausdorff studied and taught mathematics in Leipzig and beginning in 1910 in Bonn, until 
his forced retirement in 1935. He was persecuted because of his Jewish origin. In 1942, he and his 
wife committed suicide, shortly before deportation to the concentration camp. Hausdorff’s main re¬ 
search fields were topology and group theory. He introduced the concept of partly ordered sets and 
dealt with Cantor’s continuum hypothesis. He founded a theory of topological and metric spaces and 
about 1919 defined the concepts of dimension and measure named after him. 

15 J.D. Farmer, E. Ott and J.A. Yorke, Physica 7D, 153 (1983). 

16 C.E. Shannon and W. Weaver, The Mathematical Theory of Communication , Univ. of Illinois 
Press, Urbana (1949). 

17 See, e.g., H.-O. Peitgen, H. Jurgens and PH. Richter, op. cit. p. 735. 
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EXERCISE 


Exercise 26.1 


26.1 The Baker Transformation 


Problem. The process of stretching and folding, which is characteristic for the phase 
flow in chaotic systems, can be illustrated by a simple two-dimensional discrete map¬ 
ping. Let the phase space be the unit square [0, 1] x [0,1]. The motion of a point 
proceeds according to the transformation 


x n+ \ — 2 x n mod 1, 

0 <x n < j, (26-24) 

2 — — 1 > 


ayn, 

yn+1 = 11 for 

2 +ay n . 


with a parameter 0 < a < 1/2. Interpret this transformation, and calculate the fractal 
dimension of the set that is generated by application of (26.24) on the unit square. 





Fig. 26.8. The baker transfor¬ 
mation 


Solution. The effect of this transformation may be simply visualized geometrically. 
All lengths in v-direction are stretched by the factor 2. If a point thereby passes over 
the right boundary of the unit interval, then it is set back to the left by one unit 
length. This mapping is also called the Bernoulli shift; compare Example 27.2. Si¬ 
multaneously, all lengths in y-direction are compressed by the factor a. Points from 
the right half of the unit interval are additionally shifted by 1/2 in y-direction up¬ 
ward. This reminds us of the work of a baker: The pastry is rolled out to twice its 
length. Then the part sticking out is cut off and put on as a second layer onto the 
pastry. (If a < 1/2, there still is an “air cushion” of thickness 1/2 — a between the 
layers.) 

The volume change of a phase space domain A V under (26.24) is composed of the 
product of the stretching- and compression factor in v- and y-direction; thus, 


AV„+i = 2aAV n . (26.25) 

For a — 1/2, the volume is conserved. However, a connected domain in phase space 
is rapidly tom up by the repeated “back-folding” (here it is more a back-shifting) and 
distorted beyond recognition. The figure represents the first steps of the transformation 
of a circle of radius 1/2. 

For a < 1/2, the volume element shrinks in each iteration step and goes asymp¬ 
totically to zero. In the sense of Chap. 23, one also speaks of the dissipative baker 
transformation. The resulting geometrical object is a fractal which in y-direction dis¬ 
plays the structure of a Cantor set. Just as for the latter one may calculate the fractal 
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Exercise 26.1 

dimension according to (26.17). In the y-direction in the first transformation step the 
unit interval is converted into 2 parts of length a , in the next step 4 parts of length a 2 , 
and generally 2 n parts of length a n . If one selects a sequence of overlaps of the lateral 
length e n =a n , then N(e n ) = 2 n (l/c n ), where the second factor originates from the 
overlapping of the v-axis. The fractal dimension is therefore 

]nN(e n ) ln(2 n /e n ) nln(2/a) ln2-lna 

D\ = lim -= lim -= lim -=- 

n—^oo ln(l/6ft) n^oc ln(l/e n ) n^oo ln(l /a) —In a 

= 1 + 2 ^ 7 - (26.26) 
In a\ 
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In this chapter, we shall get to know various dynamical systems that may display 
highly complex forms of motion despite their very simple, even trivial structure. We 
shall meet previously discussed concepts such as bifurcations and periodic and strange 
attractors (limit cycles and chaotic trajectories) in a series of specific examples. For 
sake of simplicity, we shall begin with the investigation of systems with discrete dy¬ 
namics. The examples considered will gradually become physically more and more 
realistic. 


27.1 Dynamics of Discrete Systems 

So far, we have been interested in dynamical systems that are described by continuous 
differential equations with respect to time. In the Poincare mapping we have seen 
the possibility of reducing a continuous system to a time-discrete system. But since 
the Poincare mapping of a realistic physical system as a rule cannot be given in a 
closed form, it is instructive instead of these to consider simple mathematical model 
mappings. As it turns out, such models share many properties with systems which 
from a physical point of view are more interesting. 

Let us consider a sequence of vectors xo, xi, X 2 ,... which is generated by a simple 
iterative mapping , i.e., by repeated application of a continuous function f(x), 

x„+i=f(x„). (27.1) 

The asymptotic behavior of the sequence of the x n may be characterized as in Chap. 24 
by the appearance of fixed points or periodic limit cycles. The stability of a fixed 
point x p is again governed by the eigenvalues of the Jacobi matrix D = df/dx\ xP . 
We shall not go again into these details but still note an interesting relation between 
stationary and periodic solutions. Consider e.g. aperiodic solution alternating between 
two values: xi, X 2 , xi, X 2 ,.... Then obviously X 2 = f(xi) and xi = f(x 2 ), from which 
it follows that xi = f(f(xi)) = f 2 (xi) and also X 2 = f 2 (x 2 ). Here f 2 (x) must not be 
confused with the power (f(x)) 2 . This procedure may also of course be transferred to 

periodic solutions of length m, x\ , X 2 ,..., x m , xi, X 2 ,_Correspondingly, a periodic 

solution of the iteration function f reduces to a set of m constant fixed points of the m- 
times iterated function f m . For continuous (not time-discretized) solutions x(t), such 
a relation does not exist. 


W. Greiner, Classical Mechanics , 

DOI 10.1007/978-3-642-03434-3_27, © Springer-Verlag Berlin Heidelberg 2010 
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27.2 One-Dimensional Mappings 

For further simplification, we now concentrate on one-dimensional mappings x n +\ = 
f(x n ). The sequence of the x n can then be simply constructed graphically. The func¬ 
tion f{x) is plotted in a two-dimensional coordinate system with x n as the abscissa 
and x n +\ as the ordinate, together with the bisector of the angle x n +\ = x n . Start¬ 
ing with the first point x n = x\, the associated function value x n+ i = f(x\) = X 2 is 
marked. Since this value shall serve as the initial value for the next iteration step, it 
must be transferred to the v^-axis. This is done by drawing a horizontal line to the bi¬ 
sector. The intersection point x n = X 2 is the base for marking again the function value 
*n+ 1 = f(X2)=X 3 , etc. 

This geometrical construction is represented in Fig. 27.1, where the various possi¬ 
bilities of stability of a fixed point = f(x s ) of the mapping are illustrated simul¬ 
taneously. As is known, the derivative of the function at the fixed point, k = f'(x s ), 
governs the stability (eigenvalue of the Jacobi matrix). For \k\ < 1, the fixed point 
is stable (point attractor). The sequence of points approaches the fixed point either 
monotonically (for 0 < k < 1, (a)) or alternating (for —1 < k < 0, (b)). A value 
\k\ > 1, on the contrary, implies an unstable fixed point where the point sequence 
escapes, again either monotonically (for k > 1 , (c)) or alternating (for k < — 1 , 
(d)). In the limit k = — 1, there results a periodic sequence of length 2, while for 
k = +1 also each point in the vicinity of is a fixed point. In order to deter¬ 
mine the stability for \k\ = 1, the first derivative (linear approximation) is not suf¬ 
ficient. 

In Examples 27.1-27.3, various discrete one-dimensional mappings are presented 
and their dynamics are studied in detail. 


Fig. 27.1. Various kinds 
of fixed points of a one¬ 
dimensional mapping. 

The fixed point is stable 
((a) and (b)) or unstable 
((c) and (d)), depending on 
the slope of the function f(x) 
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EXAMPLE 

27.1 The Logistic Mapping 

One of the simplest but not least impressive examples of an iterative mapping is gen¬ 
erated by the logistic function. 1 This function is defined as an inverted parabola 

f(x)=ax( 1-x), x £ [0, 1], (27.2) 

with zeros at the border of the unit interval and a maximum at /(1/2) = a/A. The 
logistic function depends on a real parameter a which shall lie in the range 1 < a < 4, 
since otherwise the mapping either would lead beyond the unit interval [0, 1] (a < 0, 
a > 4) or would become trivial (0 < a < 1, all solutions converge toward x = 0). 

The mapping (27.2) has the two fixed points 

1 

x s i = 0 and x s 2 = l -, (27.3) 

a 

with the derivatives 

/'(x,.|) = a and f'(x s2 ) = 2-a. (27.4) 

Since a shall be > 1, the first fixed point is always unstable. The second one is stable 
(point attractor) if 1 < a, < 3. In this parameter range all solutions look like that repre¬ 
sented in Fig. 27.2 for a = 2.8; i.e., x n converges toward the fixed point x s 2 , for small 
starting value initially monotonically increasing, later on alternating. If a exceeds the 
value a\ = 3, then according to (27.4), the fixed point x s 2 becomes unstable, too. 

1 

0.8 
0.6 
0.4 
0.2 


(a) 

The geometric or numerical construction of the solution shows that a stable limit 
cycle of period 2 evolves, as is illustrated in Fig. 27.2(b) for the case a = 3.3. For 




(b) 


1 The logistic mapping may be interpreted with some effort as the model equation of a particular 
physical system; see the remark at the end of Example 27.3. It is also related to other fields of science. 
The logistic mapping was introduced first in 1845 in biological population dynamics by the Belgian 
biomathematician P.F. Verhulst. There it describes the evolution of a population of animals or plants 
in a restricted environment whereby each iteration corresponds to a new generation (the old one 
thereby dies off). For a low population density, the reproduction rate is positive (if a > 1). But, if one 
approaches the saturation density x = 1, the reproduction rate decreases by overpopulation such that 
x n+ \ < x n . The simple parabola ansatz is sufficient to generate complex dynamics. 


Fig. 27.2. Typical trajectories 
of the logistic mapping. For 
a = 2.8, the attractor is a fixed 
point, and for a = 3.3, a limit 
cycle of period 2 
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Example 27.1 


Fig. 27.3. The iterated logis¬ 
tic function /(/(*)) displays 
two maxima for a > 0 


a = a\, a period doubling of the solution arises. One faces a pitchfork bifurcation as 
is illustrated in Fig. 25.3. The old solution becomes unstable, and there appears a pair 
of new stable solution branches. (Note that here both branches simultaneously belong 
to the solution, since the latter one periodically jumps back and forth between the 
branches. This behavior differs from the case studied in Chap. 25, where both solution 
branches were independent of each other.) 

The period doubling can also be understood by inspecting the iterated mapping 
f 2 (x) = /(/(x)). As already mentioned, for this mapping a periodic solution reduces 
to two separated stationary fixed points. The iterated function 

f 2 (x) = a 2 x(l — x)(l — axax 2 ) (27.5) 

is a polynomial of fourth order with zeros at x = 0 and x = 1. Figure 27.3 shows this 
function for various values of the parameter a. For a < a\, the function / 2 intersects 
the bisector x just as / does only at the two points x s \ and x s 2 given in (27.3). As 
a polynomial of fourth order, (27.5) allows however four intersection points with a 
straight line, and just this happens for a > a\. As shown in Fig. 27.3, / 2 has a mini¬ 
mum at v = 1/2 for a > 2 and two maxima at x = 1/2 ± y/(l/4) — l/(2a). This can 
be seen without much calculation from 

fix) = f (fix)) { 21 . 6 ) 



A zero of the derivative (extreme value of f 2 ) follows from f'(x) = 0; thus x = 1/2. 
For the two other zeros one finds 

x = r l (\} since // ( /(x) ) = /, ( / ( / “ 1 (^))) = /, (^) =0 ’ (27 - 7) 

which as a quadratic equation yields the two roots mentioned above. For the slope of 
/ 2 (x) at the position of the fixed point x s 2 , one obtains 

f 2 'ix s2 ) = ia- 2) 2 . (27.8) 

For a <a\ =3, this slope is smaller than 1, and therefore x s 2 is a stable fixed point of 
/ 2 too. Of course this must be so because f 2 simply picks every second value of the 
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series of the x n , and thus it takes over the stability from the solution of the mapping /. 
For a = a\, f 2 {x) touches the bisector, and for a > a\ two new intersection points x \, 
X 2 arise. These are two stable fixed points of the iterated mapping / 2 , i.e., f 2 (x i) = 
x \, f 2 (x 2 ) = X 2 , which are related by X 2 = f(x 1 ) and x\ = f(x 2 ). 

If the parameter a increases further, these fixed points also become unstable at a 
critical value a = < 22 , and the game repeats for the mapping f 2 (x). A further bifurca¬ 
tion arises there with period doubling, and the new stable solution has a period length 
of A. Figure 27.4 shows this solution for the example a = 3.48. 




(a) x n (b) x n 


The critical value of 0 i 2 still can be given analytically, but with some effort. To 
this end, one first has to find the position of the fixed points x\ and X 2 . The condition 
f 2 {x) = v together with (27.5) leads to a quartic equation. But we already know two 
of the nodes, namely, the fixed points x s \ and x s 2 from (27.3), which may be factored 
out by polynomial division: 

f 2 (x) — x = x(x — 1 + l/a)(a 3 x 2 — a(l + a)x + a + l) = 0. (27.9) 

Setting the last bracket to zero yields a quadratic equation that determines the two new 
fixed points, namely, 


1 / 1 

*1,2 — ~z ( 1 H— 
2 V oi 



(27.10) 


The second bifurcation occurs when these fixed points become unstable. This is de¬ 
cided by the magnitude of the slope of the function f 2 (x) at the points x\,X 2 - The 
slope begins with the value +1 at a = ot\, and then decreases continuously to —1. At 
this point, the next bifurcation arises. One therefore has to solve the equation 

f 2 '(x 1,2,02) = -l. (27.11) 


Taking into account (27.6) and (27.10), equation (27.11) seems to depend on a in 
a very complicated manner. After some elementary transformations, (27.11) reduces 
however to a simple quadratic equation for both fixed points in common, 

— Tjo.2 — 5 = 0. (27.12) 

Hence, the critical bifurcation parameter for which the 2-cycle turns over to the 4-cycle 
is 


Example 27.1 


Fig. 27.4. (a) For a = 3.48, 
the iterated logistic function 
f {fix)) has one unstable and 
two stable fixed points at x ~ 
0.43298 and x ~ 0.85437. 
(b) The trajectory shows a pe¬ 
riod of length 4 


a 2 = l+V6 = 3.4495.... 


(27.13) 
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That both fixed points simultaneously become unstable is plausible and immediately 
follows from (27.6), since the slope of f 2 (x) has the same value at both points: 

f 2 '(x i) = /'(/(xi))/'(xi) = f'(x 2 )f'(f(x 2 )) = f 2 '(x 2 ). (27.14) 

It is not surprising that with increasing a the cycle of the period 4 becomes unstable 

too. There arises a full cascade of period doublings at a\, 012 , a?,, _In the interval 

oik < a < (*k+ 1 » there exists a stable limit cycle of period 2 k . Mathematically one may 
consider instead of the limit cycle the iterated function / 2 (x) which displays a set of 
2 k distinct stationary fixed points. 

One should note that the critical points oik are more and more closely spaced. As 
was found at first empirically and then analytically, the oik obey the law of a geometric 
sequence that converges toward a cluster point a oo: 

1 

oi k — oioo . (27.15) 

The number 8 is a constant that may be determined from the ratio 


8 = lim 


oik ~ oik-1 


k—>oo Olk-\- 1 — Oik 


For its value, one gets numerically 


(27.16) 


8 = 4.669201 


(27.17) 


and the accumulation point is at 

cxoo = 3.569944.... (27.18) 

The cascade of period doublings was considered first by GroBmann and Thomae . 2 
Feigenbaum 3 showed that the behavior (27.15) is not restricted to the logistic mapping 
but is universally valid for a large class of iterative mappings . 4 

Surprisingly, the numerical value (27.17) is also universally valid, and 8 is there¬ 
fore called the Feigenbaum constant. Essentially it suffices that the iterated function 
be smooth and display a quadratic maximum. The mathematical properties of the bi¬ 
furcation cascade have been thoroughly studied; for a survey see, e.g., H. Schuster, 
Deterministic Chaos , VCH Publishing Company, 1989. 


2 S. GroBmann and S. Thomae, Z. Naturforsch. 32a, 1353 (1977). 

3 Mitchell Feigenbaum , American physicist and mathematician, b. December 19, 1945, Philadelphia. 
Feigenbaum studied electrical engineering at the City College of New York and physics at MIT, 
where he did his doctorate in 1970 in high-energy physics. In 1974, he went to Los Alamos National 
Laboratory, where he was involved with problems of turbulence and nonlinear dynamics. In 1982, 
Feigenbaum was appointed as a professor at Cornell University, and presently, he heads the laboratory 
of mathematical physics at Rockefeller University in New York. Around 1976, he discovered, at first 
in numerical experiments on the logistic mapping, the universality of the period-doubling cascade 
and the constant that is named after him. In 1986, Feigenbaum (in common with Albert Libchaber, 
who experimentally demonstrated the period doubling in the flow of liquids) was awarded the Wolf 
Prize in physics. 

4 M.J. Feigenbaum, J. Stat. Phys. 19, 25 (1978). 
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Of particular interest is what happens beyond . For a = obviously a cycle 
of “infinite period” arises, i.e., an aperiodic, never-repeating solution, a > a oo is the 
domain of chaos that is characterized by irregular and seemingly random trajectories. 
As an example, Fig. 27.5 shows a fraction from such a chaotic solution calculated for 
a = 3.9. 



The attractor diagram of the logistic mapping: Figure 27.6 presents an attractor 
diagram of the logistic mapping that is highly instructive and of amazing complexity. 
For each value of a on the abscissa the values passed by a trajectory are plotted as 
a point cloud along the ordinate direction. The first iteration steps (here 500) were 
omitted in order to filter out “transient processes” (transients) and to represent the 
asymptotic attractor itself. The subsequent (200 each) iterations are then plotted in the 
diagram. 



Example 27.1 


Fig. 27.5. A chaotic trajectory 
of the logistic mapping for 
a = 3.9 


Fig. 27.6. The attractor dia¬ 
gram of the logistic mapping 
(Feigenbaum diagram) shows 
the iterated values x n versus 
the parameter a. One notes a 
cascade of period doublings, 
and above c^oo — 3.569944 
a chaotic domain that how¬ 
ever is infiltrated by win¬ 
dows of regular solutions. The 
marked rectangular domain is 
shown in linear magnification 
in Fig. 27.7 
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Example 27.1 

The left part of Fig. 27.6 shows the first segments of the bifurcation cascade: At a\ , 
« 2 , « 3 , the attractor splits in 2, 4, and 8 branches, respectively. According to (27.15), 
the further critical points ak follow each other so closely that they are no longer re¬ 
solved in the figure. Beyond one sees continuous bands more or less uniformly 

grey. The grey domains are passed through by some kind of “scars.” Obviously these 
places are more frequently passed by the trajectory, and thus, the probability of finding 
P(x) of the attractor is increased here. 

One notes that in the chaotic domain a > a oo, the orbits are confined to a partial 
interval of [0, 1]. The borders of this interval are obtained as x max (aO = /(I/2) = a/4 
and v m i n (aO = / 2 ( 1/2) = (1/16 )<x 2 (4 — a). In the limit a = 4, the full unit interval is 
passed. Thereby, the attractor covers the interval completely. More precisely, for any 
point v e [0,1] and any e > 0, one can find an n(e) such that \x n — x\ < e. Thus, the 
trajectory approaches each point arbitrarily closely if one waits for a sufficiently long 
time. 

If the parameter is reduced from a = 4, there arise various interesting phenomena. 
For example, when going below a' x ~ 3.6785 a band splitting is observed. The chaotic 
attractor which formerly covered a connected interval splits at a = a[ into two disjunct 
domains, as is clearly seen in Fig. 27.6. The trajectory thereby alternates back and 
forth between the two “partial bands.” For a further reduction of a , the partial bands 
in turn split again, into 4 parts at a' 2 ~ 3.5926, and so on. Similar to the bifurcation 
cascade a\, a 2 , ..., cioo of the regular attractor, there also exists a kind of reversed 
bifurcation cascade a' 2 , ..., a'^ of the chaotic attractor. Both cascades meet at a 

common limit . 

A closer inspection of the attractor diagram Fig. 27.6 reveals that there are also 
windows of periodic solutions embedded in the chaotic domain. Particularly striking 
is the domain with solutions of period 3 which occur above a ~ 3.8283. This is con¬ 
nected with a fixed point of the triply iterated mapping that arises at the position of 
the maximum of /(x), i.e., at x = 1 / 2 : 

/3 G)4 < 27i » 

One easily confirms that this happens at 043 = 1 + \/8 ~ 3.8284. The point x = 1/2 
therefore has a particular meaning, since here the derivative vanishes; /'(1/2) = 0 . 
Because 

^/ 3 (x) = /'(/ 2 W)/'(/«)/'«, (27.20) 

this property transfers also to all iterated mappings. This guarantees that the fixed point 
(27.19) is stable. Here, one even speaks of a superstable cycle , since the magnitude 
of the derivative of f n (x) which determines the stability takes the smallest possible 
value, namely zero. Due to the continuity of the mapping as a function of the parame¬ 
ter, the period-3 cycle is still stable in a finite environment of 043 . In the downward 
direction the window of stability borders on the chaotic region. 

If a increases beyond 043 , one again finds a cascade of bifurcations with period 
doubling, i.e., the 3-cycle turns into a 6 -cycle, etc. The bifurcations follow each other 
more and more closely until chaotic solutions emerge again. 

This consideration is not restricted to the cycle of period 3 but holds also for arbi¬ 
trarily larger period lengths. There exist superstable cycles for any natural number m 
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(their number even increases exponentially with m) with period length m which are 
defined by 


/ 


m 




(27.21) 


They are always enclosed by a small window of regular cyclic orbits in the otherwise 
chaotic domain. 

It is highly instructive to study the attractor diagram in detail. Figure 27.7 shows a 
small section a = 3.52 to a = 3.65 at the border of the 3-cycle window. The agree¬ 
ment with the full attractor diagram Fig. 27.6 is amazing. Although there are tiny dif¬ 
ferences in the shape of both geometrical objects, their structure is nevertheless very 
similar. This property—a partial section looks just as the entire diagram—is called 
self-similarity (in the parameter space). For the attractor diagram of the logistic map¬ 
ping, the process of successive magnification of a section may be continued ad infini¬ 
tum: In Fig. 27.7, one again finds partial domains that resemble the entire figure and 
so on, without an end. 


Example 27.1 



a 


Fig. 27.7. A small section of 
the attractor diagram of the lo¬ 
gistic mapping shows quite a 
similar structure to the full di¬ 
agram in Fig. 27.6 


The Lyapunov exponent of the logistic mapping: In the nonchaotic range, the 
Lyapunov exponent may be calculated rather simply analytically. We first consider 
the parameter range a <a\ =3. Here, a stable fixed point exists at x S 2 = 1 — 1 /or; see 
(27.3). The quantity to be calculated is 


l n ~ L 

cr= lim - Vln |/'(*/) |. (27.22) 

n—>oo n ' 

1=0 

The influence of transients is excluded by the limit process. Because jc/ -> x s 2 , only 
the fixed point contributes, and thus the sum reduces to a single term: 
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cr = ln\f'(x s2 )\ = In \ot(l -2x s2 )\ =ln|a(l -2 + 2/a)| 

= In 12 — of |. (27.23) 

The result shows that the Lyapunov exponent diverges logarithmically at a -> 2, 
a -> — oo. This of course happens just there where the derivative of the function / 
at the fixed point vanishes, which corresponds to a particularly fast approach to the 
attractor. This situation is called superstable. When approaching the parameter value 
cy —> ofi = 3 , the Lyapunov exponent vanishes. This is a general feature of bifurca¬ 
tion points, since here the old attractor loses its stability, and the new attractor is not 
yet born. Beyond oq, the period-2 cycle becomes an attractor, and therefore, a again 
decreases to negative values. 

It is not difficult to calculate the Lyapunov exponent in the interval ol\ < a < a 2 . 
Since now a periodic attractor exists that alternates between the points x\ and x 2 
from (27.10), (27.22) reduces to a sum of two terms: 

a = Lnl/Wl + Ln|/'(x 2 )| = I In \f'(x l )f(x 2 )\. (27.24) 

For the derivatives, we get 


fix 1 , 2 ) = 0(1 - 2 X 1 , 2 ) = —1 T 7(a + l)(o-3), (27.25) 

and the Lyapunov exponent is 

cr = i In 1(1 — (a + l)(a -3)1. (27.26) 

This function begins with the value o = 0 at a = aq, then decreases monotonically 
and diverges at a = 1 + y/5. This is again the superstable point of the 2-cycle, since 
one easily confirms that 

f2 (^) = \ at « = 1+V5. (27.27) 

o then increases again and finally reaches the value o = 0. This happens when the 
argument of the logarithm in (27.27) takes the value — 1, and thus, 1 — (a +1) (a — 3) = 
— 1, which leads to the quadratic equation (27.12). The solution a = a 2 = 1 + V 6 is 
the bifurcation point at which the 2-cycle becomes unstable. In the further course 
of the bifurcation cascade a 2 < a < a^, the game is repeating, and the Lyapunov 
exponent oscillates in the interval 0 > a > — oo. 

A qualitatively new feature arises in the chaotic range a > since here a takes 
positive values. This is represented in Fig. 27.8, for which the Lyapunov exponent was 
determined numerically. To the left of or = c^oo, one faces the range with negative o 
that has been discussed already. For a > a oo, cr becomes positive and on the aver¬ 
age increases with increasing a. In the limit a = 4, there results the maximum value 
a = In2 ~ 0.6931, as will be shown in Exercise 27.2. The chaotic domain is repeat¬ 
edly interspersed with windows corresponding to regular solutions where a becomes 
negative. The figure reflects the complexity of the function cr (a) only imperfectly. The 
windows may be perceived only approximately, due to the limited resolution. The in¬ 
dicated peaks pointing down in the figure are actually poles extending to a — oo. 
A realistic representation of cr(a), when plotted with finite line width, would display 
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Fig. 27.8. Numerically calcu¬ 
lated values of the Lyapunov 
exponent o of the logistic 
mapping. For o > 0, chaotic 
trajectories arise 


only a largely homogeneous black block that extends to — oo. There are infinitely 
many windows with stable cycles, and one may even show that they lie densely over 
the entire real interval 0 < a <4: Any arbitrarily small vicinity of each point a still 
includes stable cycles! 

The function a (a) also has the property of self-similarity, just like the attractor 
in Fig. 27.6. For each step of magnification, the partial sections look like the full 
figure. 


EXERCISE 


27.2 Logistic Mapping and the Bernoulli Shift 


Problem. The mapping 


y n +\ =2 y n (mod 1) with y n e [0, 1] (27.28) 

is called the saw-tooth mapping, or the Bernoulli shift. 

(a) Discuss the trajectories y n generated by the Bernoulli shift as a function of the 
start value yo- 

(b) Show that for the parameter a = 4, i.e., in the region of the fully developed chaos, 
the logistic mapping and the Bernoulli shift are equivalent. 

Hint: Use the transformation of variables 

x n = i(l - cos(2ny n )) = sin 2 (ny n ). (27.29) 

(c) Find the frequency distribution P(x) of a “typical” trajectory x n of the logistic 
mapping for passing the various points in the interval 0 < x < 1, and calculate the 
Lyapunov exponent a from this distribution. 
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Fig. 27.9. The mapping func¬ 
tion of the Bernoulli shift 


Solution, (a) The graph of the function /(y) = 2 y (mod 1) consists of two pieces of 
a straight line with the slope 2 that are shifted relative to each other, as is represented 
in Fig. 27.9. Since everywhere /'(y) = 2 > 1, stable fixed points cannot exist. The 
iterated solution of (27.28) is simple, 

y„=2"y 0 (modl). (27.30) 

Hence, all iterated solutions of an initial value yo are explicitly known, nevertheless 
the mapping is chaotic! This is due to the factor 2", which implies an exponential 
enhancement of smallest deviations in the initial value yo. As is represented in the 
figure, an interval Ay is stretched by the Bernoulli shift by a factor 2 to the length 
2Ay. Values falling into the range y > 1 are folded back to the unit interval by the 
modulo operation. This repeated sequence of stretching and folding is characteristic 
of chaotic mappings. In this way there results a thorough mixing of the trajectories 
and a sensitive dependence on the initial conditions. 

The Bernoulli shift mapping may be visualized by writing y as a number in binary 
representation : 


00 

y = Y J h 2~ k i.e., y = O.bi b 2 b 3 b 4 ■ ■. with fc* = Oorl. (27.31) 

k= 1 

A doubling of y corresponds to a left shift (therefore the name) of the binary digits bk , 
and the modulo operation ensures that the digit before the decimal point is cut off: 


f(y) = b\.b 2 b'$b/[... (modi) 
= 0. Z?2 ^3 b4 bs _ 


(27.32) 


Now the effect of the Bernoulli shift becomes clear: Each iteration enforces the “back 
digits” in the binary expansion. If the initial value yo is known to an accuracy of 
2~ m , this information is exhausted after m iteration steps. Later on there remains only 
“numerical noise,” the trajectory wanders around through the unit interval in a non- 
predictable way. 

Mathematically, there still remains a subtle difference between rational and irra¬ 
tional values of the initial condition. A rational number (a fraction p/q) yo has a 
binary representation which (after a finite number of steps) becomes periodic. Con¬ 
sequently the trajectory y n formed according to (27.28) with (27.32) also becomes 
periodic. A simple example is provided by the cycle 2/3,1/3,2/3,..., since 


yo — 0.101010... — - ^l + ^ + ^H- 

yi = 0 .oioioi ... = i( 1 + 2 + l + ... 

y 2 = 0.1 0 1 0 10 ..., 


1 

2 

1 

4 


1 - 1/4 


1 - 1/4 


2 

3’ 

1 

3’ 


The rational numbers lie densely on the real axis. Thus, there are infinitely many start 
solutions leading to periodic trajectories, and in any arbitrarily small vicinity 6 of a 
point y one finds such solutions. On the other hand, the set of rational numbers has 
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the measure zero ; these are therefore “atypical” numbers. If one adopts a “typical” 
initial value yo, e.g., a random number in the interval [0,1], then the probability for 
yo being rational and thus leading to a periodic trajectory is arbitrarily small. For a 
physicist, rational initial conditions do not play a role because of the finite precision of 
measurement. In the numerical simulation, the situation is different: If the computer 
used stores the numbers with a precision to m bits, then after at most m steps the 
simulation of the Bernoulli shift becomes meaningless. Fortunately for many purposes 
it makes no difference whether one is dealing with a cyclic solution with a very long 
period or with a genuine nonperiodic solution. 

(b) The logistic equation 


X n +1 =4x„ (1 -X„) 


is transformed to the new variable y n . The right-hand side then becomes 


d(i - COS(27T V„)j ^1 


4^(1- cos(2 ny n ) 11 1 - \ + \ cos ( 2jr y„) 

1 


= 1 - cos 2 (2 7ty n ) - 1 - -(l + cos(4^y„)) 


= 1(1 -cos(4jry„)), 
and hence, (27.33) reads 

1(1 - cos(2^y n+ i)) = 1(1 - cos(4jr;y„)) 


or 


cos(27ry^ + i) = cos(47ry n ). 
This solution is solved by 
y n + 1 = 2 y n (mod 1), 


(27.33) 


(27.34) 


(27.35) 

(27.36) 

(27.37) 


i.e., the Bernoulli shift mapping. When transformed back to the variable x, the solution 
(27.37) reads 

x n = -(l — cos(27r2 n yo)) = sin 2 (7r2 n xo) 

= sin 2 (2 n arcsin Vx). (27.38) 


As was discussed in (a), this leads for almost all initial values xo to chaotic trajectories 
which cannot be calculated even numerically if n becomes large. 


(c) In the range of definition of the Bernoulli shift mapping (27.28), no point is 
particularly distinguished. For typical, i.e., irrationally chosen initial conditions, the 
solution (27.30) will meet all numbers in the interval (0, 1) with the same probability, 
hence P (y) = 1. This implies a corresponding probability of the logistic mapping of 


dy 

P(x) = 2P(y)-f=2 
ax 


1 1 

2n sin(7ry) cos(7ry) 


1 

7ty/x(l ~ X) 


(27.39) 


Exercise 27.2 
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The factor 2 arises because the transformation equation has two solutions y symmet¬ 
rical about y = 1/2 for any value of x. The probability of finding P(x ) is minimum 
at v = 1/2 and increases toward the borders of the unit interval. At x 0 and jc —> 1, 
the function diverges but remains integrable. It is normalized to unity. 

For the calculation of the Lyapunov exponent (26.8), we replace the mean value 
over the time sequence by a mean value over the probability distribution P(x): 



(27.40) 


Systems for which this substitution time average phase-space average is permis¬ 
sible are called ergodic. The proof of ergodicity of a system is in general not simple. 
Since f\x) = 2 for all x and since the probability distribution P(x) is normalized to 
unity, the Lyapunov exponent of the logistic mapping at a = 4 is obtained as 


or =ln2 = 0.6931..., (27.41) 

which agrees with the numerical result from Fig. 27.8. 


EXAMPLE 


27.3 The Periodically Kicked Rotator 

In this section, we shall meet another example of a discrete mapping that, despite of 
its simple shape, leads to complex solutions and to chaotic behavior. Thereby, several 
new concepts arise, and a way is described that leads from a quasiperiodic to a chaotic 
motion. 

In contrast to the logistic mapping, which was introduced as a purely mathematical 
example, we now consider the motion of a specific mechanical system, namely, a 
damped rotator that is under the influence of an external force. The corresponding 
equation of motion for the rotational angle 6 reads 

0 + pd = M(P,t). (27.42) 

Here, ft is a friction parameter, and M(6,t ) describes the imposed time-dependent 
torque divided by the moment of inertia of the rotator. The external force shall depend 
periodically on the time. The problem simplifies if the force is acting in short pulse¬ 
like impacts spaced in time by the period T : 

00 

M(6,t) = M(6)y2 s (t-nT). (27.43) 

n =0 

The nonautonomous differential equation of second order 

00 

6 + pd- M(6) y^S(t -nT) = 0 

n=0 


(27.44) 
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can be rewritten as usual in an autonomous system of three coupled differential equa¬ 
tions of first order. With v = 0, y = 6, and z = t, we have 

* = y> 

00 

y = -pd + M(0)J2 8 (z- n T), (27.45) 

n =0 

Z= 1. 

Because of the specific form of the force (27.43), the rotator is again and again accel¬ 
erated by the pulse, but otherwise moves freely influenced only by friction. Therefore, 
the equations of motion can be integrated exactly. For this purpose, we introduce the 
discretized variables 

x n = lim x(nT — e), y n = lim y(nT — e). (27.46) 

6 —>0 6 —>0 

Thus, position and velocity are scanned shortly before the individual kicks. We now 
consider the momentum number n and integrate the equation of motion over the nth 
time interval nT — e < t < (n 1 )T — e. Since only one kick contributes in this 


interval, the equation of motion for y reads 

y = -py + M(x)S(t - nT). (27.47) 

The solution of the homogeneous differential equation holding between the kicks may 
be given immediately: 

y(t)=ae~P t for t^nT. (27.48) 

The impact of force causes a sudden increase of velocity according to 

pnT +€ 

yinT + e) — y(nT — e) — I dt(-0y + M(x) S(t - nT)) (27.49) 

JnT—e 

= -2ePy(nT) + M(x(nT)) 

— M(x(nT)), (27.50) 

as is schematically represented in Fig. 27.10. With (27.48) and (27.49), one obtains 

y(t) = (y n + M(x n ))e~ fi{t - nT) for nT < t < (n + 1)7\ (27.51) 

The angle coordinate x(t) results from this by integration: 

x(t)-x n + [ dt'y(t') =x n - l( e -/ 3(f - n7 ’ ) - l)( 3 ?„ + M(*„))• (27.52) 

J nT P 

Equations (27.51) and (27.52) taken at t = (n + 1 )T — e thus yield a two-dimensional 
discrete mapping 

X„+ 1 =Xn + 1(1 - e~P T )(y n + M(x„)) (mod2^), (27.53) 

Vn+l = e-f >T (y n + M(x n )). (27.54) 


Example 27.3 



Fig. 27.10. Periodic impacts 
of force (lower part) cause sud¬ 
den changes of velocity ( up¬ 
per part). The influence of fric¬ 
tion between the impacts is 
perceptible 
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Fig. 27.11. The force F(0 al¬ 
ways points in the same direc¬ 
tion, independent of the dis¬ 
placement angle 0 of the rota¬ 
tor 


By performing the modulo operation, one takes into account that v is a periodic angu¬ 
lar coordinate. The system (27.53) describes a Poincare mapping of the periodically 
kicked rotator. 

The angular dependence of the torque M(x) is determined by the specific physical 
system. If a body is moving on a circular orbit and is under a force pointing always 
in the same direction (see Fig. 27.11), then the torque is proportional to the sine of 
the angle, M = r x F = — r F sin# e z . In addition, we take into account an angle- 
independent torque Mo, i.e., with Ko = r F\ 

M(x) = M 0 + K 0 sinv. (27.55) 

The system (27.53) and (27.54) with the nonlinear force law (27.55) is called the dis¬ 
sipative circular mapping (“dissipative” since we are dealing with the limit of strong 
damping) and displays very interesting dynamic properties. 

The equations may be written still more clearly by introducing the following ab¬ 
breviations: 



(27.56) 


r n represents a rescaled velocity coordinate. Insertion into (27.53) leads to the follow¬ 
ing form of the dissipative circular mapping: 


x n +\ = x n + br n + Q — K sinx n (mod 2n), 


(27.57) 

(27.58) 


y n +i = e P T (y„ + M(x n )). 


One can iterate this equation numerically and study the solutions for various values 
of the parameters b, K, and Q. A further simplification results in the limit of strong 
damping , i.e., fT ^> \ ox b <^\. (To overcome the friction, Kq must, according to 
(27.56), increase linearly with c.) In this case, the velocity y n is decelerated to zero 
immediately after each “kick.” Consequently, the equation for the angle decouples to 

Xn +1 = / (Xn) (mod 27T) 

= x n + Q — K sinv n (mod 27r). (27.59) 

This equation is called the one-dimensional circular mapping or the standard map¬ 
ping. Its mathematical properties were intensely studied, in particular by the Russian 
mathematician V.I. Arnold. 5 Its interesting properties are due to the nonlinearity of 
the sine function. 

Let us consider for a moment the trivial limit K = 0, i.e., the linear circular map¬ 
ping 

x n+ \ = x n + Q (mod 2ir). (27.60) 


V.I. Arnold, Trans, of the Am. Math. Soc. 42, 213 (1965). 


Example 27.3 



Fig. 27.11. The force F(t) al¬ 
ways points in the same direc¬ 
tion, independent of the dis¬ 
placement angle 0 of the rota¬ 
tor 


5 
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The rotator is moving forward in equidistant steps £2. If it reaches the old position 
again after a finite number of steps, the motion is periodic. This obviously happens if 
£2/2 tt is a rational number , 


Q = 2tt — , 

q 


p and q coprime numbers. 


(27.61) 


This means that after q time steps the rotator has performed p full turns, i.e., takes 
again (modulo 27 t) its original position. One deals with a solution with the period q. 
If however £2/27r is an irrational number , the initial point xo is not reached again even 
after an arbitrarily long time. But there occur values x n in any arbitrarily small vicinity 
of xo. In such a case the motion is called quasiperiodic. 

In order to characterize the motion, one may define a winding number. 


1 

W = — lim 

2jt n^oo 


/ W (x 0 ) -XQ 

n 


(27.62) 


Here, f n (x o) is the ft-fold iterated mapping function (without performing the modulo 
operation) from (27.59). The winding number thus represents the mean shift per stroke 
interval. W = W(Q, K) depends on both parameters of the circular mapping. In the 
linear case, K = 0, W just coincides with the fraction p/q defined in (27.61). Since 
the rational numbers, although densely located on the real axis, form only a set of 
measure zero, the “typical” trajectories are quasiperiodic. 

What happens now if the nonlinearity becomes efficient (i.e., for K ^ 0) in the 
circular mapping (27.60)? Let us consider a periodic solution with a rational winding 
number W = p/q. The angular coordinate passes a cycle xi, X 2 ,..., x q of length q 
that causes a final shift of x q = x\ + 2np mod 2n =x\ \ hence, 


f q {x \)=X\ +2 np. 


(27.63) 


At the beginning of the present chapter, we met the criterion for the stability of a 
discrete mapping, in the case at hand, of the g-fold iterated function f q (x): The mag¬ 
nitude of the derivative of the function must be smaller than unity, and thus, 


l/n*i)l 


d 

dx i 


/(/(•• •/(■*!))) 


q 

n f\xi) 

;=i 


q 

j~~[(l — K cosx/) 

i—\ 


< 1. 


(27.64) 


If this condition is fulfilled, then x\ (and since all points in the cycle are on equal 
base, all other x; too) belongs to a stable periodic attractor. In the linear case, K = 0, 
the derivative has always the value f q '(x) = 1. There exists marginal stability where 
neighboring orbits are neither attracted nor repelled. If however 0 < K < 1, then each 
of the solutions Q p , q —2ix p/q becomes a periodic attractor with a domain of at¬ 
traction of finite width AQ Ptq . This is an example for a very interesting phenomenon 
that arises in many branches of physics. Vibrating systems which are characterized 
by two distinct frequencies adjust themselves—provided that there is a correspond¬ 
ing interaction—in such a manner that the frequencies are synchronized , i.e., are in 
an integral ratio to each other. The phenomenon is also called mode locking. The two 
frequencies of the system considered here are determined on one hand by the stroke 
length T, and on the other hand by the magnitude of the torque Mq. 
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Example 27.3 Possibly the earliest experimental evidence for the phenomenon of mode locking 

is ascribed to the Dutch physicist Christian Huygens. He observed that a series of 
pendulum watches (Huygens played a decisive role in their discovery) which were 
suspended in a row began to vibrate in the same rhythm, although their limited accu¬ 
racy of movement would rather have suggested a drifting apart. Huygens recognized 
that the weak coupling of the watches via their common back wall was responsible for 
the synchronization. 

The extension of the mode-locking ranges can be calculated from (27.63). For 
larger period lengths q, this may be performed only numerically. We therefore re¬ 
strict ourselves here to the simplest case q = 1. If there is a complete synchronization 
with winding number W = 1, then (27.63) becomes 

f{x\) = x\ + 27T or Q = K sinvi + 27r. (27.65) 

The stability condition (27.64) reads 

|1 -K cos vi | < 1, (27.66) 


which is fulfilled for angles 0 < x\ < n/2 or 3n/2 < x\ <2n. The associated values 
of Q may be read off from (27.65): 


2 7t -K < <2n + K. (27.67) 

The range of mode locking with just one turn of the rotator per stroke interval thus 
has the shape of a triangle that opens with increasing K . Just the same consideration 
leads to the attraction domain of the attractor with winding number W = 0, and thus, 
p = 0, q = 1: 

—K < Q 0 ,l < K. (27.68) 

Because of the periodicity of the angle coordinate, it suffices to investigate the inter¬ 
val 0 < Q < 27r. Values beyond this range mean only that the rotator for each kick 
performs additional full turns, which does not change the dynamics significantly. In 
Fig. 27.12, the borders of the ranges (27.67) and (27.68) are drawn as straight lines. 
The parameter values for which a periodic synchronized motion arises are shaded in 
the diagram. 


Fig. 27.12. Several domains 
of stability for the one¬ 
dimensional circular mapping 
depending on the parameters 
£2 and K . The hatched regions 
are called “Arnold tongues.” 
On the lines in the upper half 
the condition of superstability 
is fulfilled 



Analogous considerations may be made for any rational winding number W = 
p/q. Some of the stability ranges (there are, of course, infinitely many ones) are 
represented in Fig. 27.12. The width of these ranges, which are also called Arnold 
tongues , increases monotonically with K. In the linear limit (K = 0), the summed-up 
total width of all Arnold tongues is equal to zero (measure of the rational numbers), 
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as was already mentioned. It can be shown 6 that for K = 1 the Arnold tongues cover 
the entire Q -range: 

y]A£2 p , 9 =27r for K = 1. (27.69) 

PA 

Hence, the situation is exactly complementary to the case K = 0; the “typical” so¬ 
lutions are now periodic (mode locking), while the quasiperiodic solutions have the 
measure zero. 

The function W(Q), i.e., the winding number as a function of the frequency para¬ 
meter Q at K = 1, is called the devil's staircase (see Fig. 27.13). It is a function that 
is everywhere continuous but nowhere differentiable. To any rational number p/q 
belongs a step; the width of the steps decreases with increasing period length q\ its 
total width according to (27.69) covers the entire interval. The devil’s staircase has the 
property of self-similarity, i.e., any sectional magnification again resembles the entire 
object. 



If the parameter of the nonlinear coupling exceeds the value K = 1, then the Arnold 
tongues coalesce. This is connected with a conversion of the quasiperiodic solutions 
into chaotic ones, as may be seen from the occurrence of positive Lyapunov exponents. 
At the same time, for K > 1 there still exist domains of periodic solutions with a 
negative Lyapunov exponent. Both types of solutions are interwoven in a complex 
manner. For the logistic mapping, we observed a mixing of regular and chaotic motion. 
But now the relations are even more complicated, since now two parameters, £2 and 
K, may be varied independently. An interesting feature of the chaotic solutions is that 
they don’t have a well-defined winding number. The solution moves so irregularly that 
the limit in (27.62) does not exist. 

In the upper half of Fig. 27.12, for K > 1, the centers of several periodic ranges 
are plotted as lines. Here, the condition of super stability introduced on page 524 is 
fulfilled, i.e., the derivative of the iterated mapping function of a q -cycle vanishes, 
f q \x) = 0. For the period q — 1, this condition may be evaluated easily. The solution 
with winding number W = 0 has as its fixed-point condition 

f(xx)=x\\ thus, £2 — K sinvi = 0. (27.70) 


M.H. Jensen, P. Bak and T. Bohr, Phys. Rev. A30, 1960 (1984). 
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Fig. 27.13. The steps of the 
“devil’s staircase” are ranges 
where the winding number 
“clicks into place,” i.e., is in¬ 
dependent of the frequency 
parameter f2. The sectional 
magnification on the right in¬ 
dicates the self-similar struc¬ 
ture of the devil’s staircase 


6 
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This 1-cycle is superstable if (see also (27.59)) 

/'(xi) = 0; thus, l — K cosxi = 0. (27.71) 

With the value xi following from (27.70), equation (27.71) yields the condition for 
superstability 

K = v/l + f2 2 (27.72) 


as is plotted in Fig. 27.12. The condition for a superstable fixed point with the winding 
number W = 1 follows analogously as 

K = Vl +(2 tt - ft) 2 . (27.73) 


The crossing of the curves indicates that for equal values of Q and K distinct stable 
solutions may coexist. In the range K > 1, there is no longer a straightforward relation 
between the parameters K, El, and the winding number W. Which of the solutions is 
realized then depends on the initial condition for v. 

The occurrence of chaotic solutions is associated with a qualitative change of the 
mapping function f(x). As Fig. 27.14 shows, for K < 1, f(x ) is a monotonically 
increasing function. For K > 1, the nonlinear coupling is so strong that f(x ) reflects 
the shape of the sine function; i.e., there arise (quadratic) maxima and minima. Similar 
to the previously treated logistic mapping, f{x) is not invertible for K > 1. This is a 
necessary (but not sufficient) condition for the occurrence of chaos in one-dimensional 
mappings. 


Fig. 27.14. (a) The mapping 
function of the circular func¬ 
tion for K < 1 increases 
monotonically. (b) For K > 
1, a maximum develops, and 
f(x) is no longer invertible. 
The function f(x) = x + 
El (mod 2tt) is drawn as a 
dashed line 


K< 1 



K> 1 



The complex behavior in the mode locking expressed by the devil’s staircase is 
well confirmed by experiment. Even simpler than mechanical oscillators are nonlinear 
electric circuits. For example, mode locking was investigated in an externally period¬ 
ically driven circuit involving a superconducting Josephson junction and an induction 
which may be described mathematically by the circular mapping. 7 

Supplement: It should be noted that the logistic mapping from Example 27.1 may 
be interpreted as the motion of a periodically kicked rotator. For this purpose, the angu¬ 
lar dependence of the torque is not represented by (27.55) but rather by the (somewhat 


M. Bauer, U. Kruger and W. Martienssen, Europhys. Lett. 9, 191 (1989). 


7 
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artificially constructed) function 


M(x) = K 0 




(27.74) 


If we consider the dissipative Poincare mapping (27.53) in the limit of strong damping, 
f3T 1, the equation for x n again decouples, 


■* 71+1 


, K 0 (, 

■x n + —\ior- 


1>X ■ - 


(mod 27 t). 


The choice Kq = /3 leads to 


■* 71+1 — ax n 



(mod 27r), 


(27.75) 


(27.76) 


which after rescaling of the angular variable to the unit interval by x' n — x n l2n yields 
the logistic mapping 
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x' n+l = ax' n (l - x' n ) (mod 2 tt). (27.77) 

For parameter values a < 4, the values of x n are automatically bound to the unit inter¬ 
val, and hence the modulo operation may be omitted. 


EXAMPLE 


27.4 The Periodically Driven Pendulum 

In the preceding examples, we have studied systems the dynamics of which could be 
described by the iteration of simple, analytically known discrete mappings. The logis¬ 
tic mapping from Example 27.1 with its extremely simple structure served as a “test 
laboratory” for investigating many aspects of nonlinear dynamics but has no plausi¬ 
ble physical analog. For the “periodically kicked” damped rotator of Example 27.3, 
the dynamics could also be reduced to the iteration of discrete equations of motion (in 
the limit of strong damping to the one-dimensional circular mapping (27.57)), because 
of the pulse-like nature of the acting force. Another model example, possibly even 
more realistic and appropriate for a clear illustration of the characteristic phenomena 
of nonlinear dynamics, is the periodically driven pendulum . 8 

Let the pendulum, a nonlinear oscillator with a backdriving force proportional to 
the sine of the displacement angle 0 , be under the influence of an additional external 
force with a harmonic time dependence. Moreover, let the system be damped by fric¬ 
tion which is proportional to the velocity. Mathematically, these system properties are 
described by the following equation of motion: 

^2 q 

— T + /3 -b sinO = f cos(Qt). (27.78) 

dt z dt 


8 See, e.g., G.L. Baker and J.P. Gollub, Chaotic Dynamics , Cambridge University Press, Cambridge 
(1996). In this book, extensive use is made of the example of the driven pendulum. We also refer to 
H. Heng, R. Doerner, B. Huebinger and W. Martienssen, Int. Journ. of Bif. and Chaos 4, 751, 761, 
773 (1994). 
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Here, P is the friction parameter, and / and £2 denote the strength and frequency of 
the driving force, respectively. The eigenfrequency of the pendulum has been set to the 
value coq = 1, which may always be achieved by rescaling the time and the parameters 
p and /. As was described in Chap. 23, this explicitly time-dependent differential 
equation of second order may be rewritten in a system of three coupled autonomous 
(i.e., not time-dependent) differential equations of first order: 


— = —pco — sin 0 + / cos <p , 
dt 


dO 

dt 


= oo, 


d(p 

dt 


= n. 


(27.79) 


For P > 0, this is obviously a dissipative system, since the divergence of the velocity 
field F (compare (23.16)) is then negative: 


A = V-F = -p. 


(27.80) 


The equations of motion of the driven pendulum are too complicated to allow analytic 
solutions. Their numerical integration by the computer does not cause any trouble, 
however. 9 

Depending on the parameters £2, P, /, the driven pendulum displays many distinct 
types of motion. Here we shall investigate only a small section of the parameter space, 
namely the dependence on the driving strength / for fixed values of the frequency £2 
and of the friction constant p. As an example we choose a frequency £2 = 2/3 for all 
of the subsequent investigations , i.e., a value somewhat below the natural vibration 
frequency of the pendulum, and a friction parameter P = 0.5. 

The system of differential equations (27.79) is integrated numerically for various 
values of the parameter /, beginning with selected initial conditions 0(0) and cd( 0). 
To avoid needless effort, transient processes, i.e., the initial solutions of, e.g., the first 
20 vibrational periods, will be ignored. 

The effect of dissipation in the system ensures that the solution after some finite 
time approaches an attractor. The shape of this attractor may be analyzed in different 
manners. One may directly consider the time dependence of the displacement angle 
0(t) or plot the trajectory in the three-dimensional phase space 0,oo,<fi, where the 
third coordinate because of 0 = £2 1 just corresponds to the time. More transparent 
than this three-dimensional representation are reduced two-dimensional phase-space 
diagrams where the time is considered as a parameter and the trajectory is plotted in 
the 0, CD-plane. Contrary to the full three-dimensional phase space, the projected orbits 
may intersect here. Since 0 is a periodic angular variable, we restrict it to the interval 
—7T < 0 < 7T by the modulo operation. A trajectory that leaves the diagram at the right 
or left edge, corresponding to a loop of the pendulum, therefore enters again at the 
opposite edge. 

Figure 27.15 shows a gallery of selected phase-space diagrams arranged by increas¬ 
ing value of the driving force /. The value of / is always given at the top left in the 
partial figures. 


9 Readers are advised to explore the dynamics of the driven pendulum by their own computer ex¬ 
periments. For the integration of the differential equation a Runge-Kutta approach is recommended; 
see, e.g., W.H. Press et al., Numerical Recipes , Cambridge University Press (1989). 
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<9 


For weak perturbations, e.g., f — 0.9, the pendulum performs approximately har¬ 
monic librations about the zero position. The limit cycle Q(t) is a slightly distorted si¬ 
nusoidal vibration with the frequency £2, and correspondingly the path in phase space 
is approximately an ellipse. 

With increasing perturbation strength a bifurcation arises at about / = 1.07 with 
a period doubling , as is represented in Fig. 27.15 for / = 1.075: Two slightly differ¬ 
ent vibrational tracks are alternating. After further period doublings, one then finds a 
libration with twice the amplitude (represented for / = 1.12) in which the pendulum 
performs a loop but then moves back. The frequency of this oscillation is £2/3. 

In the range / ~ 1.15... 1.3, there arise chaotic solutions. The trajectory for 
f — 1.2 in Fig. 27.15 fluctuates in an erratic manner between librations and rotations 
in both directions, and correspondingly the path densely covers a domain in phase 
space. For comparison, Fig. 27.16 shows a regular (/ = 1.12) and a chaotic (/ = 1.2) 
trajectory, which correspond to the third and fourth partial figure in Fig. 27.15. 

For even stronger coupling /, the chaotic range is left again, and there occur rotat¬ 
ing periodic solutions, as is represented for / = 1.4. The angle increases linearly with 
time, according to 6(t) a zb Qt, superimposed by local fluctuations. For / = 1.45, a 
bifurcation with period doubling of the rotating solution occurs. On the average, the 
angle is unchanged, but now the local deviations alternate from period to period. At 
/ = 1.47, one obtains a second period doubling. 


Fig. 27.15. Typical phase- 
space trajectories of the 
periodically driven damped 
pendulum for various values 
of the parameter /. The 
parameters £2 = 2/3 and 
b — 0.5 are kept fixed 
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Fig. 27.16. Two trajectories 
0(t) for distinct values of the 
parameter /. For / = 1.12 
(left) a periodic motion arises, 
for / = 1.2 (right) a chaotic 
one. Note the distinct scales! 




t t 


After a bifurcation cascade, there again occurs a range with chaotic solutions , as 
is represented for the example / = 1.5. But soon the chaos is followed by regular 
motion, as is demonstrated by the beautiful phase-space trajectory for f — 1.51 in 
the last shown example. Here, one faces a periodic libration motion with two loops 
(angular range 3 • 27 t) and the period 5 (i.e., the frequency Q/5). 

A global survey of the behavior of the system is obtained from the attractor di¬ 
agram. One of the coordinates is scanned in regular time intervals, and the result is 
plotted along the ordinate as a function of a system parameter. Figure 27.17 shows the 
angular velocity co(t n ) scanned at the time points t n = to + n 2tt/ Q versus the strength 
/ of the driving force. For any value on the abscissa, 150 values of co are plotted. 
The upper margin marks the nine values of / for which the associated phase-space 
trajectories are shown in Fig. 27.15. 


Fig. 27.17. The attractor dia¬ 
gram of the driven pendulum 
exhibits the sequence of reg¬ 
ular and chaotic domains as 
a function of the parameter 
/. The phase-space trajecto¬ 
ries corresponding to the fre¬ 
quency values marked at the 
upper margin are shown in 
Fig. 27.15 



/ 


The attractor diagram clearly exhibits the previously discussed alternating ranges 
of regular and chaotic motion. The chaotic window at f — 1.15 ... 1.28 is followed by 
a broad domain of periodic (rotating) solutions at / = 1.28 ... 1.48, showing several 
pronounced period-doubling bifurcations (a “subharmonic cascade”). In the ranges 
/ = 1.11... 1.15 and / > 1.54, one finds solutions of period 3. The structural sim¬ 
ilarity of the attractor diagram for the driven pendulum with its counterpart for the 
logistic mapping, Fig. 27.6, is obvious. 
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One should note that the attractor diagram represented in Fig. 27.17 is not complete. 
This is due to the fact that our system of differential equations (27.79) is invariant 
under reflection because the pendulum has no preferred direction of oscillation. Each 
solution is accompanied by a reflected trajectory 

0 — 0 (mod 2 tt), co —co, <p —> 0 + it (mod 27 t), (27.81) 

which also satisfies the equation of motion. Angle and velocity are inverted, and the 
phase is shifted by a half-period. In general, the solutions always occur pairwise, 
which is obvious for rotational solutions because the pendulum may run “clockwise” 
or “anti-clockwise ” Which attractor is actually reached depends in a complicated 
manner on the initial conditions 0(0), &>(0). When plotting Fig. 27.17, only one path 
has been calculated, hence the “reflected” branches are missing. (The points are not re¬ 
ally reflected about 0 = 0 , since in the stroboscopic scanning the phase of the scanning 
moment is kept fixed and not shifted according to (27.81).) 

However, for periodic trajectories of period length n 2 tt / Q it may happen that 0 (t + 
mt/Q) = —0(0 (mod 27 t) holds. Such a symmetric trajectory is identical with its 
reflected partner, and there exists only one solution. This occurs, e.g., for the n = 3- 
vibration shown in Fig. 27.16 (left), and in particular also for the (n = l)-librations for 
small /. At about f = 1.01, the interesting case of a symmetry-breaking bifurcation 
occurs, which may be recognized by the kink in the attractor diagram Fig. 27.17. The 
attractor continues to be periodic with n = 1 , but looses its symmetry and therefore 
splits into a pair of distinct solutions which are reflected relative to each other. In the 
figure only the upper branch of this fork is included. 

The attractor diagram provides a good survey of the various kinds of motion 
of a dynamic system. A more far-reaching quantitative measure of the stability 
of trajectories are the Lyapunov exponents Oi discussed in Chap. 26. The driven 
pendulum—a three-dimensional system—has three Lyapunov exponents. One of 
these, let us call it ( 73 , has always the value <73 = 0. It belongs to the degree of free¬ 
dom 0, which according to (27.79) has the trivial linear time dependence cp(t) = Qt. 
Thus, any perturbations along this direction neither increase nor shrink exponen¬ 
tially. 

The maximum Lyapunov exponent o\ determines the stability of the system. At¬ 
tractors with o\ < 0 are periodic, those with o\ > 0 are chaotic. Figure 27.18 shows 
the result of a numerical calculation of the maximum Lyapunov exponent, plotted over 
the same parameter range as in the attractor diagram Fig. 27.17. One may clearly trace 
the sequence of regular and chaotic domains. At the bifurcation points the exponent 
o'\ touches the zero line from below. 

A conspicuous feature is that o\ never falls below the value —0.25. This is re¬ 
lated to the fact that in the dissipative system under consideration the sum of all three 
Lyapunov exponents is determined by the negative of the friction coefficient ft (here, 
P = 0.5): 

3 

= (27.82) 

i =1 

Because <73 = 0 in the case of maximum stability o\ = <72 = —/3/2. 
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Fig. 27.18. The largest Lya¬ 
punov exponent o\ of the 
driven pendulum as a function 
of the parameter /. Values 
g\ > 0 occur in the domains 
of chaotic motion which were 
perceptible already in the at¬ 
tractor diagram Fig. 27.17. 
Compare also the analogous 
representation of the Lya¬ 
punov exponent of the logistic 
mapping in Fig. 27.8 



The Poincare cut introduced in Chap. 24 may further characterize the attractors. 
When choosing 0 = 0o (mod 27 t) as the cut condition, one just has a stroboscopic 
mapping at equidistant time points t n = to + 2 tt/Q. The three-dimensional phase 
space reduces to two dimensions (0,co), and the continuous trajectory turns into a 
cloud of points. The Poincare cuts of periodic attractors simply consist of one or sev¬ 
eral fixed points, the number of which corresponds to the period length of the vibra¬ 
tion. The situation is different, however, for the nonperiodic strange attractors which 
are characteristic for the occurrence of chaotic motion. Here, the cloud of points of 
the Poincare cut covers extended partial domains of phase space more or less uni¬ 
formly. 

Figure 27.19(a) illustrates the situation for the chaotic attractor of the pendulum 
with a driving strength / = 1.2. The points were obtained by stroboscopic scanning 
of the trajectory shown in the fourth partial figure of Fig. 27.15 over 2000 vibrational 
periods. The detailed shape of the Poincare cut depends on the selected phase an¬ 
gle 0 O . 

The long curved object in Fig. 27.19 at first glance appears as a strange bent one¬ 
dimensional curve. A closer look, however, shows the chaotic attractor to be a much 
more complex geometric object. In the sector magnification of a small partial range 
of the attractor (see Fig. 27.19(b)), the seemingly single line dissolves into several 
closely spaced curves. But this is only the beginning, since a repetition of this opera¬ 
tion would show that in each sector magnification the new lines again decay into sev¬ 
eral fractions, and the procedure may be repeated infinitely many times. (This process 
is limited in practice only by problems in the numerical integration of the equation of 
motion. By the way, in order to plot Fig. 27.19(b), 100,000 periods had to be calcu¬ 
lated.) 

Thus, the attractor of the chaotic driven pendulum displays an infinite filigree 
(“puff-pastry”) structure. Mathematically it is a fractal with broken dimension (see 
Chap. 26) since the points of the Poincare cut occupy, roughly speaking, a larger vol¬ 
ume in phase space than a one-dimensional curve, but on the other hand they are too 
rare to cover a two-dimensional area. An analogous statement holds also for the full 
attractor in the three-dimensional phase space. 
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Fig. 27.19. The Poincare cut 
of a chaotic trajectory (/ = 
1.2) of the driven pendulum. 
The range in the upper fig¬ 
ure marked by a box is repre¬ 
sented below as a sector mag¬ 
nification, revealing the frac¬ 
tal structure of the strange at¬ 
tractor 


In Chap. 26, we dealt with the determination of the fractal dimension. A remarkable 
point is that the Lyapunov exponents 0 { may also be used to determine the dimension 
of a strange attractor. The existence of such a relation is not implausible, because the a; 
decide how “fast” a region of phase space spreads under the dynamic flow. Building 
on this consideration, Kaplan and Yorke 10 derived a formula for calculating a Lya¬ 
punov dimension D l. For our special case (one positive and one negative Lyapunov 
exponent), the Kaplan-Yorke relation reads 

Dl = 1 + — for <7i > 0, <72 < 0. (27.83) 

I <*21 

The relation between Dl and the other dimension measures has not yet been cleared 
up in full. The originally assumed identification of Lyapunov dimension and capacity 
dimension Df cannot be maintained, since counter-examples have been found. More 
recent speculations rather concern a possible relation with the information dimension 
D l = D h 

For the Poincare cut 27.19, one gets from (27.83) with o\ ~ 0.14, 02 — —0.64, 
the Lyapunov dimension Dl — 1.2. This value depends sensitively on the friction 


10 J.L. Kaplan and J.A. Yorke, Functional differential equations and approximation of fixed points, 
H.-O. Peitgen and H.O. Walter (eds.), Lecture Notes in Mathematics 730, Springer, Berlin (1979). 
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constant /3. For a weaker damping, the strange attractor “blows up,” and its dimension 
increases. 


EXAMPLE 

27.5 Chaos in Celestial Mechanics: The Staggering of Hyperion 

Hyperion is one of the more remote moons of Saturn. It revolves about Saturn with 
a revolution period of 21 days, on an ellipse with eccentricity £ = 0.1 and a large 
semiaxis a = 1.5 • 10 6 km. 

The motion of Hyperion is a particularly impressive example of chaotic stagger¬ 
ing within our solar system. In this section we shall describe this behavior using a 
simplified model. The satellite Voyager 2 among others has supplied pictures of the 
moon Hyperion. Hyperion is an asymmetric top that may be roughly described by a 
three-axial ellipsoid with the dimensions 

190 km x 145 km x 114 km (±15 km). (27.84) 

Hence, one obtains for the principal moments of inertia ©i < ©2 < © 3 : 



(27.85) 


The striking prediction is that Hyperion performs a chaotic staggering motion in the 
sense that its rotational velocity and the orientation of its rotational axis vary signif¬ 
icantly within a few revolution periods. This chaotic dancing, which must have hap¬ 
pened also for other planetary satellites during their history (e.g., Phobos and Deimos 
with the planet Mars have been calculated), is implied by the asymmetry of Hyperion 
and by the eccentricity of the orbit. 

To describe the change of the rotational velocity, we adopt the following model 
(see Fig. 27.20): Hyperion H orbits Saturn S on a fixed ellipse with semimajor a and 
eccentricity s. r represents the distance between Saturn and Hyperion, ip the polar 
angle of motion. Thus, the trajectory of Hyperion is given by 


Fig. 27.20. A simple two- 
dumbbell model for the asym¬ 
metric Saturn moon Hyperion 



1 









27.2 One-Dimensional Mappings 


545 


r((p) =- 

1 + £ cos cp 


(27.86) 
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Its asymmetric shape is simulated by four mass points 1 to 4 with equal mass m which 
are arranged in the orbital plane. Let the line 2-1 (distance d) be the (body-fixed) 
ei-axis, the line 4-3 (distance e < d) the (body-fixed) e 2 -axis. The e 3 -axis points 
perpendicular out of the image plane: e 3 = ei xe 2 . The angle d specifies the rotation 
of Hyperion about the e 3 -axis. It is defined as the angle between the semimajor a and 
the ei-axis. The moments of inertia obey 



(27.87) 


In this model, the satellite shall rotate only about the e 3 -axis, i.e., the axis perpendicu¬ 
lar to the orbital plane with the largest moment of inertia. This restriction is motivated 
because the tidal friction over very long times causes (1) the rotational axis of a moon 
to align along the direction of the largest moment of inertia, and (2) causes this direc¬ 
tion to adjust perpendicular to the orbital plane. Moreover, the orbital angular momen¬ 
tum of Hyperion is assumed to be constant. This is a very good approximation, since 
the intrinsic angular momentum L e of Hyperion is always very small relative to the 
orbital angular momentum L#, |L^ |/|L^ | ~ ( d 2 + e 2 )/a 2 ~ 10 -8 . The gravitational 
field at the position of Hyperion is not homogeneous, and since ©i and ©2 are distinct, 
the satellite experiences a torque that depends on its orbital point and its orientation 
d, which will be calculated now. The tidal friction shall be neglected, however. The 
torque acting on the pair of masses (1, 2) is 


D (1.2) = ^l x(Fl _p 2)j 


(27.88) 


where 

ymMri 


(27.89) 


is the force acting on the mass point i, M being the Saturn mass. Figure 27.21 once 
more illustrates the torque that is caused by the gravitational forces Fi and F 2 at the 
positions ri and r2, and by the centrifugal force F = — (Fi + F2) at the position r. 


Fig. 27.21. The forces caus¬ 
ing the torque D^ 1,2 ^ 


2 



1 


Since the length d ~ 200 km is small compared to the distance r ~ 10 6 km, the 
cosine law yields 



(27.90) 
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The positive sign holds for r\, the minus sign for 7 * 2 . a is the angle between ri — r 2 
and r. From (27.90), we obtain 


1 ~ 1 (^ 3d \ 
-r- ^ — I 1 =F - - cos O' I. 


r 3 r 3 V 2 r 


(27.91) 


Hence, for D (1,2 ^ we find 
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(27.92) 


In the last step of rewriting, yM has been expressed by the orbital period T and the 
semimajor a , using the third Kepler law: 


2ttY 


yM = I — I A’ 


(27.93) 


Thereby, the reduced mass was replaced by the mass of Saturn. 

The torque for the pair of masses (3, 4) is obtained in the analogous way as 


D (3,4) = £|2 x(F3 _ F4) 
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The total torque D = D^ 1,2 ^ + D^ 3,4 ^ is therefore 


(27.94) 


D = ^(y-) (© 2 -©i)(-| sin2ae 3 


(27.95) 
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Thus, the torque vanishes if ©i = © 2 . Besides, a configuration with a experiences 
the same torque as a configuration with 180° + a. The torque tries to rotate Hyper¬ 
ion in such a way that at any moment the ei-axis points toward Saturn. The expres¬ 
sion (27.95) for the torque remains correct even if a more realistic mass distribution is 
assumed. With 


JL 

D = — = ©3 
dt 


d 2 d 
dt 2 ’ 


(27.96) 


the equation of motion for the eigenrotation of the satellite reads 


© 3 # : 


~\(py) (02_0l) (^7)) sin ( 2 ^-*V»)- 


(27.97) 


Here, we set a = cp — d. Equation (27.97) involves only one degree of freedom, #, but 
the right side depends via the orbital radius r(t ) and the polar angle (pit) on the time 
and is therefore not integrable. An exception is the case of a circular orbit. Then the 
mean angular frequency 



(27.98) 


equals the angular velocity co , and r = a, cp(t) = nt. With d' = 2(d — nt), the differ¬ 
ential equation (27.97) therefore simplifies for s = 0 to 


© 3 tf' = —3 m 2 (© 2 - ©i) sin 


(27.99) 


This is the differential equation for the pendulum. The integral of motion is the energy 
E. To determine this quantity, we multiply (27.99) by 

+ 3n 2 (& 2 - ©Otf'sintf' = 0, (27.100) 
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which implies 

— ( - 3« 2 (©2 - ©i)C0S!?' I =0. 

dt \2 ) 

Hence, 


(27.101) 


E = 



3ft 2 (© 2 


®i)cosd' 


(27.102) 


is an integral of the motion. Just as for the pendulum (see, e.g., Example 18.8) we 
have that for energies E larger than Eq = 3n 2 (©2 — ©i) the satellite rotates, while 
for E < Eo it vibrates. Due to the (so far neglected) tidal friction the energy E will 
decrease more and more until it reaches the minimum value E^ m = —Eq. From this 
follows ^ in = 0 and ^ in = 0. Therefore, in the final state of satellites on a circular 
orbit the ei-axis always points toward the planet (bound rotation), as is known from 
the earth’s moon. 

As was stated already, the differential equation (27.97) for s ^ 0 cannot be solved 
analytically. One may try, however, to get approximate solutions for s 1. This we 



548 


27 Systems with Chaotic Dynamics 


Example 27.5 


shall do now. First, we introduce dimensionless quantities, the time t' = nt = 2nt/T 
and <j)q = 3 (©2 — ©i)/© 3 . Then (27.97) turns into 

d 2 d coia 3 . . v 

^ = <27 103) 

Since r(t') and <p(t') are periodic in 2tt, the right side can be expanded into a Fourier- 
like Poisson series. One obtains 

d 2 d coi ^ / m \ . 

— = (27.104) 

m =—oo ' ' 


In order to determine the coefficients H(m/ 2, e), r(t') and must be known. 
(p(t) is obtained, e.g., via the second Kepler law as the solution of the following dif¬ 
ferential equation: 


dip (1 + £COS<^) 2 

dt hk 2 


(27.105) 


Solving this differential equation and hence the determination of H(m/ 2, s) are be¬ 
yond of the scope of this example. We only quote that the coefficients H are pro¬ 
portional to £ 2 \ m / 2 ~ l \ and were tabulated by Cayley 11 in 1859. For small s, we have 
H(m/ 2, s) ~ —s/2, l,ls/2 for p = m/2 = 1/2, 1, 3/2. Here the half-integer variable 
p has been introduced. If the argument of one of the sine functions varies only weakly 
with time, i.e., if 



— m 


« 1 , 


(27.106) 


there occur resonances. It is then advantageous to rewrite the equation of mo¬ 
tion (27.104) in such a way that it depends on the only slowly varying variable 
Y P = $ ~ Pt': 

--//£ = H (P - £ ) sin2 y p - sin(2 y p - nt'). (27.107) 


It turns out that the terms in the sum are oscillating so rapidly compared to the variation 
of y p that their total contribution to the equation of motion largely averages out, if 
cdq and s are sufficiently small. In first approximation for small go q and e the high- 
frequency terms can be eliminated from the equation of motion by keeping y p fixed 
and averaging (27.107) over a period. One then obtains 


(pYp 

dt' 2 


= — e) sin2]/^. 


(27.108) 


This is again the pendulum equation. Integration of (27.108) analogous to (27.100)- 
(27.102) again yields the energy 


E p = 


i /dyp 

2\dt' 
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H(p, s)cos2y p . 


(27.109) 


11 The original paper is by A. Cayley, Tables of the developments of functions in the theory of elliptic 
motion, Mem. Roy. Astron. Soc. 29, 191 (1859). 
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Again, y p vibrates if E p is smaller than E s p = | H(p, s)\cDq/4, and rotates if E p is 
larger. For H(p, s) > 0, y p vibrates about 0; for H(p, s) < 0, y p vibrates about 7 t/2 . 
The essential difference compared with the pendulum equation with s = 0 is that here 
exists not only the synchronous (p = 1 ) solution but also resonances, depending on 
the initial condition. So, it may happen, e.g., that a satellite is captured by the tidal 
friction into a p — 3/2-state. This happens in the solar system for Mercury: During 
two circulations about the Sun it rotates exactly three times about its axis. 

The question of to what extent the averaging over the high-frequency contributions 
is justified and whether these are indeed only weak perturbations is complicated and 
shall not be traced further here. But it is vividly clear that the high-frequency terms 
must not be neglected if the energy is close to the limit E p = E s p . Then these terms 
will actually decide whether the satellite performs a full turn (in y p ) or whether it vi¬ 
brates back. There exists a band of energies w p • E s p that are very close to E s p , with 
w p defined by w p = ( E p — E s )/E s . For energies within this band the high-frequency 
perturbations cannot be neglected. We shall only state without derivation that Chirikov 
found an analytical criterion for the width w p of this band . 12 Chirikov’s criterion pre¬ 
dicts that for the parameters of Hyperion co q = 0.89 and £ = 0.1 the averaging over 
the high-frequency components is not possible, since the width of the band belonging 
to p = 1 and p = 3/2 is so large that the two bands overlap. In contrast, for co^ = 0.2 
and s = 0.1 the averaging over the high-frequency components should be a good ap¬ 
proximation. The following figures represent Poincare cuts for these two cases. They 
show points in phase space taken at (p = 0 . 

Thereby, the differential equations have been solved numerically . 13 If the motion is 
quasiperiodic, then successive points form a smooth curve; chaotic trajectories seem 
to cover areas in a random manner. 

Because of the symmetry of the inertial ellipsoid, the orientation ft is identical to 
that with ft + 7 r. Therefore, the graphs were plotted only for the range of ft between 0 
and n . By averaging over the high-frequency components, solutions were obtained in 
which y p = ft — pt' vibrates (for E p < E s p ). For each of these solutions, dft/dt’ has 
a mean value of exactly p , and ft takes all values between 0 and 2tt. If one considers, 
however, only the points with cp = 0 , i.e., times t f = 2jtn, then y p exactly corresponds 
to ft modulo 7 r. Therefore, a vibration in y p appears as a vibration in ft . The successive 
points of quasiperiodic vibrations therefore yield a simple curve in the vicinity of 
dft/dt' = p that contains only a part of the angles between 0 and 7 T. For nonresonant 
quasiperiodic trajectories (E p > E s p ), all y p s rotate, and successive points form a 
single curve that contains all angles ft . 

As is seen from Fig. 27.22, for small values of coo and £ the resonant states and 
the nonresonant ones are separated by a narrow chaotic zone. The figure shows ten 
distinct trajectories. Three of them correspond to the quasiperiodic vibrations in the 
states p — 1/2, 1, and 3/2. As predicted by the approximation, y \/2 vibrates about 
ft = 7 r/ 2 ; y\ and 3 / 3/2 vibrate about ft = 0. Three further trajectories, always enclosing 
the resonant states, are chaotic. They fill narrow bands with points in a seemingly 
random manner. The last four trajectories show that each chaotic band is separated 
from the other bands by nonresonant quasiperiodic trajectories. 


12 B.V. Chirikov, A universal instability of many-dimensional oscillators systems, Phys. Rep. 52,262 
(1979). 

13 J. Wisdom, S.J. Peale and F. Mignard, The chaotic rotation of Hyperion, Internat. Journal of Solar 
System Studies 58, 137 (1984). 
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Fig. 27.22. Transverse cut of 
the phase space for = 0.2 
and £ = 0.1 



Fig. 27.23. Transverse cut of 
the phase space for the pa¬ 
rameters of Hyperion: co q = 
0.89 and s = 0.1 



Figure 27.23 displays the situation for Hyperion. At least the chaotic zones of the 
states p = 1 and p = 3/2 are no longer separated from each other. There is a small 
remainder of the quasiperiodic p = 1/2 state; the quasiperiodic p = 3/2 state disap¬ 
peared completely. Instead, there is a quasiperiodic state at p = 9/4, & = 7r/2, which 
is not given by the approximation (27.108). In total one sees 17 trajectories in the fig¬ 
ure: eight quasiperiodic vibrations of the states p = 1/2, 1,2, 9/4, 5/2, 3, and 7/2, 
five nonresonant quasiperiodic rotations, and four chaotic trajectories. 
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A deeper study shows that the alignment of the rotational axis perpendicular to the 
orbital plane is not stable, both in the chaotic as well as in the synchronous state. This 
means that a small deviation of the rotational axis from the vertical increases exponen¬ 
tially. The time scale for the resulting staggering motion is of the order of magnitude 
of several orbital periods. The final stage of a “normal” moon is for Hyperion com¬ 
pletely unstable. But if it tilts out of the perpendicular to the orbital plane, (27.97) is 
no longer sufficient, and one has to solve the full nonlinear Euler equations. One then 
finds that the full three-dimensional course of motion is completely chaotic. All three 
characteristic Lyapunov exponents are positive (of the order of magnitude 0.1), which 
implies a strongly chaotic staggering. Even if one could have measured the spatial 
orientation of the rotation axis at the moment of Voyager 1 passing Hyperion (in No¬ 
vember 1980) with a precision of up to ten figures, it nevertheless would not have been 
possible to predict the orientation of Hyperion’s axis at the moment when Voyager 2 
passed it (in August 1981). 

Up to this point, the tidal friction, which causes a relatively very slow change of 
the initially Hamiltonian system, has been neglected. But one can roughly describe 
the history of Hyperion. Presumably, the period of the eigenrotation was initially much 
shorter than the orbital period, and Hyperion began its evolution in the range far above 
that which is shown in the figure. Over a time of the order of magnitude of the age of 
the solar system (circa 10 10 years) the eigenrotation was decelerated, and the rotation 
axis straightened up perpendicular to the orbital plane. Thereby, the premises of our 
simplified model (27.97) are approximately justified. But when the evolution once had 
reached the chaotic domain, “the work of the tides lasting aeons was destroyed in a 
few days,” 14 since once it arrived in the chaotic domain, Hyperion began to stagger in 
a fully erratic manner (which continues to the present day). 15 Sometimes, its path will 
end up in one of the few stable islands of the figure. But this cannot be the synchronous 
state, since the latter state is unstable. 


14 J. Wisdom, Chaotic behavior in the solar system, Nucl. Phys. B (Proc. Suppl.) 2, 391 (1987). 

15 The observations of Voyager 2 are consistent with this statement. The staggering of Hyperion has 
also been observed directly from the earth (see J. Klavetter, Science 246, 998 (1988), Astron. J. 98, 
1855 (1989)). 
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On the History of Mechanics 


Here, we follow Friedrich Hund, Einfiihrung in die Theoretische PhysikBd. 1, Mechanik, 
Bibliographisches Institut Leipzig (1951), and also I. Szabo, Einfiihrung in die Technische 
Mechanik , Springer-Verlag, Berlin, Gottingen, Heidelberg (1956). For more on what is presented 
here, in particular on the history of statics, we refer to the works of P. Duhem: 

Les origines de la statique, Paris (1905-1906), 

Etudes sur Lionard de Vinci , Paris (1906), 

Le systeme du monde, Paris (1913-1917), 
and to P Stemagel: 

Die artes mechanical im Mittelalter (1966), 
and to F. Krafft: 

Dynamische und statische Beobachtungsweise in der antiken Mechanik (1970). 
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Physics describes nature by theoretical-physical, mathematical theories. It uses gen¬ 
eral but precise concepts, builds upon certain laws (“axioms”) and makes precise 
mathematical-logical statements on natural phenomena. An important role is played 
by the idealization of reality, i.e., the representation of reality by describing ideal sit¬ 
uations. Precise statements can be made only about these ideal cases. In doing so, 
physics utilizes systematically worked-out procedures for proof (reproducibility) of 
its statements. The application of the laws of physics has led to a far-reaching control 
of nature which seems to grow continually. 

Physics originated in the occidental culture. As a modem natural science it has 
characterized this society for about 300 years, i.e., since about the seventeenth century. 
One should note that the consideration of nature in Greek antiquity differs essentially 
from the present natural science in our society. In ancient Greece, say at the time of 
Plato 1 and Aristotle} there existed little systematic knowledge of nature. What ex¬ 
isted appears from a present-day view to be primitive and of low precision. Attempts 
at ultimate sharpening physical concepts were rarely known, although the clear line 
of thought in the then already highly developed mathematics might have served as a 
model. For example, Eudoxus 3 (400-347 BC) was one of the most outstanding math¬ 
ematicians of that age who was close to Plato. The philosopher Plato in turn admired 
the inner logical consistency of mathematics. Partial disciplines of physics, such as 
astronomy, statics, and music theory, only later were quantified. But no genuine me¬ 
chanics emerged. The analysis of motion and the precise formulation of the concept 
of force were not achieved in antiquity, although mathematics (e.g., by Archimedes, 4 
287-212 BC) was actually just as far developed as in the seventeenth century when 
the theory of motion originated. It seems to be a peculiar feature of the ancient cul¬ 
ture that mechanics in its full meaning, and hence the beginning of comprehensive 
physics, could not develop in the ancient world. It is also remarkable that the natural 
scientists of the antiquity hardly influenced the general consciousness of that age. The 
specialized sciences were not of public concern. Scientists in Alexandria, Pergamon, 
or Rhodos belonged to a closed circle. 

The cultures in East Asia and India, still existing nowadays, produced out¬ 
standing achievements in other disciplines and certain techniques (ceramics, dye- 
works), but no physics. The conceptional-mathematical thinking about nature and the 
questioning of nature by determined experiments were not developed or promoted 
there. 

Physical science emerged on one hand from the philosophic thinking about nature 
and on the other hand from technical problems (e.g., military resources, traffic, civil 
engineering, mining). In particular the first developed mechanics, which at the very 
beginning was merely statics, goes back to these two sources. A proper unification of 
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these two approaches to nature was, however, not reached in antiquity. Archimedes 
(287-212 BC) represented the climax in ancient statics: the lever principle, the con¬ 
cept of the center of gravity of a body, and the well-known hydrostatic law named 
after him, were known to him in full clarity. Yet, Archimedes’s discoveries fell into 
oblivion. The reasons are not known. Possibly his ideas were simply too hard to be un¬ 
derstood. Whatever the reasons were, in the Middle Ages only the works of Aristotle 
were known, and these determined the further development of mechanics. 

In the fourteenth century, statics enjoyed a time of prosperity, mainly due to distin¬ 
guished men of the artist faculty of the university in Paris. The methods for decom¬ 
posing and combining forces were developed there and utilized for solving statical 
problems. In addition, the concept “work of a force” was introduced, and the “vir¬ 
tual work” in virtual displacements was correctly used in simple cases. Leonardo da 
Vinci 5 (1452-1519) was a leading researcher in mechanics of his time. He performed 
the decomposition of forces for investigation of moments (lever law). For individ¬ 
ual examples he traced the law of the parallelogram of forces back to the lever law. 
These relations were clearly and precisely formulated by Varignon (1654-1722) and 
Newton 6 (1643-1727) not before the seventeenth century. 

The dynamics of mass points was created in the seventeenth century. Since this 
is a very important period in the history of mechanics, we will outline it in more 
detail. It should be mentioned that the formal completion and mathematical treatment 
of mechanics happened in the eighteenth century and culminated in the mechanics of 
Lagrange 1 (1736-1813). 

The seventeenth century was presumably the most decisive period in the history of 
physics, probably the birth of physics as an exact science per se. In that time mechanics 
was created and completed in outline, monumental in its scientific clarity and beauty, 
and convincing in its predictive power and mathematical formulation. The processes 
of motion in the heavens and on earth were described in a consistent way. Method¬ 
ically in mechanics one succeeded for the first time in sharply conceptualizing ex¬ 
periences (experiments). This was achieved mainly by using mathematical language, 
combined with the invention of very fruitful abstractions and idealized cases about 
which one was now able to make precise and final statements. The implications of the 
new physical knowledge were quickly realized in public consciousness, as is demon¬ 
strated by the science methodology of Bacon 8 (1561-1626), Jungius 9 (1587-1657) 
and Descartes 1 ® (1596-1650). 

The new scientific spirit did not concern mechanics only. Actually the first book on 
physics in the meaning of the new science stems from the English physician Gilbert 11 
(1540-1603) on the magnet. He started from well-planned experiments, generalized 
them, and in this way came to general statements on magnetism and geomagnetism. 
This book strongly influenced the intellectual-scientific evolution of that age. The 
great scientists of that era knew it ( Kepler 12 praised it, Galileo 13 used it). However, 
it was not in magnetism that physics made a great breakthrough, but dynamics. The 
following stages were important: 

(1) Kepler interpreted the processes in the sky as physical phenomena; 

(2) Galileo succeeded in the correct conceptual understanding of simple motions. He 
invented the abstraction of the “ideal case”; 

(3) Huygens 14 and Newton cleared up and completed the new concepts. 

In the following table, we list the most important researchers and thinkers of that 


era: 
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1473-1543 

1530-1590 

1540-1603 

1548-1603 

1561- 1626 

1562- 1642 
1571-1630 
1587-1657 
1592-1655 
1596-1650 
1608-1680 
1629-1695 
1635-1703 
1643-1727 


Copernicus 

Benedetti 

Gilbert (1600 De Magnete) 

Stevin 

Bacon of Verulam 

Galileo (1638 Discourses on Fall Laws) 
Kepler (1609 Astronomia Nova) 

Jungius 

Gassendi 

Descartes (1644 Principia Philosophiae) 
Borelli 

Huygens (1673 Pendulum Clock) 

Hooke 

Newton (1686 Principia). 


According to the opinion prevailing in antiquity—presumably going back to 
Aristotle—heaven and earth were greatly separated from each other. The heavens rep¬ 
resented the perfect, unchanging, divine. The earthly, on the contrary, was changing 
and chaotic. This opinion was considered to be confirmed by the circular orbits of the 
celestial bodies and the ideally straight earthly motions. The stars were interpreted as 
being essentially different from the earth, which presumably prevented the progress 
of the heliocentric world system which had been invented already in antiquity. Only 
Copernicus 15 (1473-1543) made this breakthrough. As a proof for his doctrine, which 
states that the sun is in the center of the world, he had to offer only the simplicity and 
beauty. This was not yet a “physics of the heaven,” but rather a kind of geometrical 
ordering of the world. 

Only Kepler presumably realized the connection of the motions in the sky with 
physics. He asked about the forces when he put the orbiting velocity of planets that 
decreases outward in a causal connection with a force originating from the sun and 
decreasing with the distance from it (1596). He justified the validity of the heliocentric 
system with the sun as the origin of the force which causes the planetary motions. In 
his Astronomia Nova Seu Physica Coelestis he expressed (1609) the first planetary 
laws. But his conclusion, a 1/r-dependence of the gravitational force is false. He 
concluded from the area law 


r 2 ep = constant 


that the velocity rep perpendicular to the vector sun-planet is inversely proportional to 
the radius 

constant 

rep =- 

r 

and, therefore, the force should also show such a dependence, which, as stated already, 
is wrong. He related this reasoning with the propagation of the force in the planetary 
plane. Thus, Kepler understood the problem set by the planetary motion rather clearly, 
but he still did not yet have the conceptual and mathematical tools for describing the 
curved motion. 

Kepler on the one hand was a rational astronomer and physicist who searched for 
the laws of motion of planets and for the reasons behind them; on the other hand he 
was an aesthete who considered the world as an ordered entity. In the Harmonices 
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Mundi (1619), he ascribed a particular role in the structure of the world to the five 
regular bodies. He used them as mathematical “archetypes.” The use of mathematics 
for describing connections, as soon became customary, was denied him, however. 

Galileo does not match to Kepler in the physical interpretation of planetary motion. 
He considered the circular orbit to be natural and did not yet understand the general 
meaning of the laws of inertia. Borelli 16 (1608-1680), on the contrary, qualitatively 
described the motions of celestial bodies as an interplay of the attraction by the sun 
and of the centrifugal force. He understood the planetary motion as a problem of the¬ 
oretical mechanics. But to solve this problem essential tools still had to be developed; 
first of all free fall and the throw had to be understood. 

The idea of the inertia of bodies, their persistence in the state of uniform motion in 
the absence of forces, was hard to understand. The abstraction was accepted only la¬ 
boriously. On the contrary, it was easier, more natural, to ask for the reason of changes 
of position and to look for relations between force and velocity, as was already done 
by Aristotle in nebulous form. He took the view that the thrown body had a certain 
immaterial ability which, however, gradually decreased (vis impressa natura—liter 
deficiens ). In the fourteenth century this ability was called “impetus” (Buridan, 1295- 
1336?). The impetus normally remains constant, but gravity and air resistance can 
modify it. A falling body is accelerated because more and more new impetus is added 
by gravity (Benedetti} 1 1530-1590). Galileo also shared this view, but he at first tried 
to combine this understanding with the doctrine of the Aristotelian school that force 
and velocity were proportional to each other, in a complicated nontransparent way. 

Only on the third day of the Discourses on Fall Laws , clarity about the course of 
the fall motions is achieved. The ideal cases of uniform motion and of uniformly ac¬ 
celerated motion are outlined and mathematically grasped. The uniformly accelerated 
motion is compared with the experience of free fall and the fall on the tilted plane. 
Finally, on the fourth day of the discourses the tilted throw is analyzed: it is correctly 
understood as a composition of an (ideal) propagating motion and an (ideal) fall mo¬ 
tion. 

Although Galileo described the law of inertia (persistence of bodies in uniform 
motion if no forces are acting on them) and the proportionality between force and 
acceleration, he did not express the general law of motion. He also did not apply his 
knowledge of the laws of falling bodies on planetary motion. From his entire work 
one realizes how reluctantly he gradually gave up the old ideas. 

But the new ideas gained acceptance. In 1644, Descartes made the first attempt to 
formulate general laws of motion. He talked about the persistence of bodies in the state 
of rest or of motion, about linear motion as most natural motion (meaning force-free 
motion) and about the “conservation of motion” in the impact of bodies. The latter 
is obviously the conservation of momentum (called motion) JA mi v z -. Descartes how¬ 
ever did not realize the vector character of the momentum (the “motion”). Presumably, 
therefore, his applications of this law are wrong. 

The conceptual completion of mechanics was achieved by Huygens when treating 
curved motions. He studied the motion of a body on a given path in the gravitational 
field of the earth (tautochrone problem) and demonstrated clear understanding of the 
centripetal and centrifugal force in the treatment of the pendulum clock (1673). He 
explained these topics by infinitesimal considerations as uniformly accelerated devia¬ 
tion from linear motion. He realized the proportionality between centrifugal force and 
(centrifugal) acceleration, where the acceleration is already defined infinitesimally. 
Huygens realized the momentum was a vector quantity, and thus he correctly inter- 
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preted the momentum conservation law, as can be seen from his applications of this 
law to impacts. 

Then Newton came up with his brilliant work Philosophiae Naturalis Principia 
Mathematica (1686/87), and systematically showed the connection between mass, ve¬ 
locity, momentum, and force. He demonstrated by the example of the gravitation law 
how a force (measured by the change of the momentum) is determined by the loca¬ 
tions of the involved bodies. Finally he applied the laws of mechanics to the treatment 
of planetary motion, and he showed how the gravitation law follows by induction from 
Kepler’s laws, and how the Kepler laws follow deductively from the gravitation law. 
This represented the final breakthrough and the proof of the new mechanics; so to 
speak the completion of Kepler’s quest. 

After the preceding considerations, the question might arise why Kepler failed to 
discover the acceleration law—and, hence, gravity—although it follows seemingly 
simply from his own law. But it is not for us to accuse Kepler for this reason of 
a lack of brilliance and imagination. It is beyond any doubt that he had both—the 
genius in empirical research and the imagination in far-reaching speculations.* The 
explanation is as follows: Kepler was a contemporary of Galileo, who survived him by 
twelve years. Although Kepler knew the Galilean mechanics, in particular the central 
concept of acceleration, the laws of inertia and of throwing by correspondence and 
by hearsay, he nevertheless could not work it out to a consistent structure. (Note that 
Kepler died in 1630, while Galileo’s Discorsi outlining his mechanics was published 
only in 1638.) But even more decisive is the fact that the theory of curved motion— 
invented by Huygens for the circle and completed by Newton for general orbits—was 
not available to Kepler. But without the concept of acceleration for curved motions 
it is impossible to derive the form of the radial acceleration from Kepler’s laws by 
simple mathematical manipulations. 

Newton’s mechanics of gravitation, which emerges from the dynamical fundamen¬ 
tal law and from the reaction principle, is by its nature a further development of the 
throw motion discovered by Galileo. Newton writes about this point: “That the plan¬ 
ets are being kept in their orbits by central forces is seen from the motion of thrown 
objects. A (horizontally) thrown stone under the action of gravity will be deflected 
from the straight path and falls, following a bent line, finally down to the earth. If it is 
thrown with greater velocity, it flies farther away, and so it might happen that it finally 
would fly beyond the borders of the earth and would not fall back. Hence, the mis¬ 
siles thrown away from the top of a mountain with increasing velocity would describe 
more and more extended parabolic curves and finally—at a certain velocity^—return 


* For example, in his thoughts concerning the possible number of planets, since he was convinced— 
like the Pythagoreans—that God had taken the choice on number and proportions according to a 
certain number rule. 

f The value for horizontal throw is correctly given by Newton from mv 2 /R = mg as v = V §P = 
7900 ms -1 ; for the vertical shot into space the necessary velocity is obtained from the energy law 


oo 



R 

with g = yM/R 2 as 

v = y/2gR = 11200 ms -1 . 


Both results do not involve the friction losses in the air. 
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to the top of the mountain, and in this way move around the earth.” An overwhelming 
argument—by conception and compelling logic! 

The English physicist Hooke 18 (1635-1703), who is known as the founder of the¬ 
ory of elasticity, also came close to the gravitation law. This is shown by the following 
statements made by him:^ “I shall develop a world system which in every way agrees 
with the known rules of mechanics. This system rests on three assumptions: (1) All 
celestial bodies have an attraction directed toward their center (gravity); (2) all bod¬ 
ies that are put in straight and uniform motion move in a straight line until they are 
deflected by any force and forced into a curved path; and (3) attractive forces are the 
stronger the closer they are to the body they are acting upon. Which are the various 
degrees of attractions I could not yet determine by experiments. But it is an idea that 
must enable the astronomers to determine all motions of celestial bodies according to 
one law.” 

These remarks show that Newton did not create the monument of his Principia 
out of nothing. But it took an immense mental strength and bold ideas to concentrate 
all that created by Galilei, Kepler, Huygens, and Hooke in physics, astronomy, and 
mathematics into one focus, and in particular to announce that the force that lets the 
planets circulate on their orbits around the sun is identical with the force that drives 
the bodies on the earth to the floor. 

For this knowledge, mankind needed one-and-a-half millennia if one considers that 
in the Moralia (De facie quae in orbe lunae apparet ) Plutarch 19 (46-120) states that 
the moon by the momentum of its orbiting is just in the same way kept from falling to 
earth as a body which is “rotated around” in a sling. It took the genius of Newton to 
realize what the “sling” for the planets is! 

The new mechanics had a tremendous impact on the spirit of that age. Now there 
existed a second incontestable science besides mathematics. Furthermore, the exact 
natural sciences were born: Mechanics advanced to their model. 

Fet us summarize once again the most important stages in the evolution of mechan¬ 
ics from a present-day view: The essential part of mechanics and of its fundamental 
concepts is expressed in the basic dynamic law 


dp 

dt 



= F. 


Here, basically the acceleration appears as the signature of an acting force F; the law 
of inertia, i.e., the conservation of momentum 


p = m\ 

and, hence, of the velocity v = r if no external forces are acting, is also contained 
therein. This law of inertia had already been realized in the ancient and scholastic 
mechanics (Philoponos, Buridan) by experience. Uniform motion as the ideal case 


* There are still two other hints on the many-sided active genius of Hooke: In 1665, he writes the 
prophetic words: “I have often thought that it should be possible to find an artificial, glue-like mass 
which is equal or superior to that excretion from which the silkworms produce their cocoon and which 
can be spun to threads by jets.” This is the basic idea of the man-made fiber that—although two-and- 
a-half centuries later—revolutionized the textile industry! In the same year he writes, anticipating the 
mechanical theory of heat (hence, also kinetic gas theory): “That the particles of all bodies, as hard 
as they may be, nevertheless vibrate, one needs in my opinion no other proof than that, that all bodies 
involve a certain degree of heat and that never before has an absolutely cold body been found.” 
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of motion was described by Galileo. Descartes clearly formulated the law of inertia, 
and Huygens utilized it correctly. As already stated, in the basic dynamic equation 
the acceleration appears as a differential quotient of the velocity. Huygens clearly 
realized that. He also correctly understood the acceleration as a measure of the force, 
as well as the role of the mass in the momentum. Newton summarized everything in 
a sovereign manner and applied the fundamental law to celestial mechanics. In this 
sense Newton is the endpoint of the way to mechanics to which besides him also 
Galileo and Huygens contributed essentially. The general concept, however, is due to 
Kepler. 

For the history of mechanics, we also refer to the outlines of the history of physics. 
For the sections treated here, see particularly 

E.J. Dijksterhuis, Val en worp , Groningen 1924. 

E. Wohlwill, Galilei , Hamburg and Leipzig 1909 and 1926. 

The most important original papers were translated to German in the collection: 
Ostwald’s classics of exact sciences. The main works of Kepler and Newton are avail¬ 
able in German as 

J. Kepler, Neue Astronomie oder Physik des Himmels (1609). German translation. 
Munich 1929. 

I. Newton, Mathematische Principien der Naturlehre (1686/87). German Transla¬ 
tion. Leipzig 1872. 

E. Mach, Die Mechanik in ihrer Entwicklung historisch-kritisch dargestellt (1933). 

R. Dugas, A History of Mechanics, Neuenburg/Switzerland, (1955). 

R Sternagel, Die Artes mechanical im Mittelalter (1966). 

F. Krafft, Dynamische und statische Betrachtungweise in der antiken Mechanik 
(1970). 

For modern English editions and translations of historical texts, we refer to 

J. Kepler, W.H. Donahue (Translator), New Astronomy (1992), Cambridge Univ. 
Press. 

I. Newton, I.B. Cohen, A. Whitman (Translators), The Principia: Mathematical 
Principles of Natural Philosophy (1999), Univ. California Press. 

J. L. Lagrange, A.C. Boissonnade, V.N. Vagliente (Translators), Analytical Mechan¬ 
ics (2001), Kluwer Academic Publishers. 

Finally, we mention some texts about the history of mechanics 

E.J. Dijksterhuis, The Mechanization of the World Picture: Pythagoras to Newton 
(1986), Princeton Univ. Press. 

E. Mach, T.J. McCormack (Translator), The Science of Mechanics: A Critical and 
Historical Account of its Development , 6th edition (1988), Open Court Publishing 
Company. 

R. Dugas, A History of Mechanics (1988), Dover Pub. 


Notes 

1 Plato, Greek philosopher, b. 427 BC, Athens-d. 347, Athens, was the son of Ariston 
and Periktione, from one of the most noble families of Athens. According to legend, 
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he wrote tragedies in his youth. The meeting with Socrates, whose scholar he was for 
8 years, became decisive for his turn to philosophy. After Socrates’ death (399), he 
first went with other scholars of Socrates to the town of Megara to study with Euclid. 
He then broadened his horizons by extended travel (first to Cyrene and Egypt). He 
soon returned home and opened the war against the educational ideal of the Sophists 
by his first works. He quickly won enthusiastic followers while dealing with science, 
secluded from public life. Presumably, scientific intentions led him about 390 to Italy, 
where he became familiar with the Pythagorean doctrine and school organization. He 
was introduced to the court of the tyrant Dionysius of Syracuse. Dionysius was at first 
much interested in him, but according to legend handed Plato over as a prisoner to 
the envoy of Sparta, who sold him as a slave. After payment for release and return 
to Athens, Plato founded the Academy in 387. Despite his bad experience, he set his 
hopes for a full effectiveness to Syracuse. In 368, he followed an invitation by Dion, 
uncle of the younger Dionysius, who hoped to win the young ruler over to Plato’s 
political principles. Dionysius, however, only tolerated Plato’s ideas for a short time. 
A third trip (361-360) also failed, since Dionysius distrusted him and turned against 
him. Plato spent the last years in Athens in continuing scientific activity within a circle 
of well-known scholars; according to legend, he died during a wedding meal. 

Plato’s works are all preserved, except for the lecture On the good , which can be 
reconstructed only in broad terms. But not all work recorded under his name is au¬ 
thentic. The authenticity of the 7th Letter and of Laws is controversial. 

The most important and surely authentic papers from the early period are: Apol¬ 
ogy, Protagoras, State I, Gorgias, Menon, Kratylos\ from the middle artistic period: 
Phaidon, Banquet, State II-X ; from the last years: Phaidros, Parmenides, Theaithetos, 
Sophistes, Timaios, Philebos. 

Almost all of his writings are dialogues that by language and structure are of great 
artistic beauty. In most of them, Socrates appears as the main host of conversation. 

Plato’s philosophy turns dialectics, which for his teacher Socrates had only the 
negative function of destroying the false knowledge on the good and the virtue, into 
an approach of realizing the good and the virtue-into a path to the “ideas.” The ideas 
are not acts of imagination but are the content of that being represented by them, 
which in itself is independent of us. By this distinction of the sensual (which is with 
us) from the hypersensual (the later “transcendent”), Plato became a promoter of the 
later so-called metaphysics. 

Since the innermost nature of love is the will for perpetuation, it comes to fulfill¬ 
ment only as love of the eternal ideas. All other love is a preliminary stage to that. To 
kill off the transitory sensual and to turn toward the everlasting ideas is the aspiration 
of the really philosophical man. The way toward this goal is the dialectics. Also, the 
nature of this method of perception of ideas is logic. 

Plato’s understanding of the role of the idea varies between the general idea and the 
idea a priori. Provided it is the latter one, it is not brought into man from the outside, 
but he remembers it as something he already knows but has forgotten. The under¬ 
standing of the idea is remembrance (anamnesis). The method of remembering is that 
of the hypothesis. By this, Plato means the proof in the form of the statement-logical 
conclusion: If the first, then the second. But now the first, therefore the second. Or: 
But now not the second, therefore not the first: e.g., in Menon : If virtue is knowledge, 
then it is teachable. But now it is knowledge, therefore it is teachable. But now it is not 
teachable, since there are practically no teachers of virtue. Therefore, it is not knowl¬ 
edge, but only correct opinion inspired by the Gods; to transform it into knowledge 
that is able to self-satisfy is according to Plato the essential duty of philosophy. 
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The later form of dialectics, the method of division (< diaeresis ) of the species into 
sorts, is the draft of a logical method of proof. Aristotle rightly interprets it as a pre¬ 
lude of the class-logical conclusion discovered by him. All proving is proving on 
conditions. These can be proved themselves. In the State , Plato sketches the idea of a 
completion of this proving up to the omission of all assumptions ( anyipodeton ), i.e., 
the idea of a proof by and from the purely logical. The absolute is defined here as 
the good. In his later works, Plato interprets the ideas as numbers, i.e., as units that 
include a manifold in themselves, and sees their absolute principle in the One and the 
“Great-and-small” (interpretation controversial). 

Plato remains aware of the limits of all human proofs. Where the dialectics ends, 
there remains the speculative speech that uses the language of myth. All knowledge of 
the sensual, the nature, does not go beyond well-founded presumption. Therefore, all 
natural-scientific speech is necessarily myth. Plato develops this idea in his dialogue 
Timaios , which was of particular influence in the Middle Ages. 

The question of what is for man the good and the virtue as the way toward this 
goal is answered by Plato by means of his dialectics of ideas, at first in the State by 
his doctrine of the four cardinal virtues: wisdom, bravery, prudence, and justice. The 
sketch of an ideal state serves only for proving this doctrine and does not represent 
a plan that should be realized. This “ideal state” with its subdivision into the three 
orders of scholastic profession, military profession, and peasantry, and his doctrine 
on the community of possessions and women, and on the necessity that the kings 
should become philosophers and the philosophers should become kings, later on was 
interpreted as a political program and became efficient. In the late work Philebos , 
Plato sees the good of human life in the composition of knowledge ( episteme ) and joy 
(hedone), where all knowledge is admitted, but among the joys only those that are not 
mixed with grief, those that cannot impair the knowledge. Men must be educated in 
the spirit of such a life ideal if a real and stable state shall be possible. This restoration 
program demands a radical restriction of the influence of the traditional literature on 
the individual and on the community. Plato’s philosophy and critics of art were also 
of extraordinary historical influence. 

Plato’s doctrine, Platonism, was first developed further in Plato’s school, the Acad¬ 
emy. One distinguishes the older, intermediate, and younger Academy. In the older 
one, whose first and most important leaders were Speusippos and Xenocrates, the 
Pythagorean attitudes of the late philosophy of Plato were emphasized. The ratio 
of ideas and numbers became the focus of interest; soon mythological elements 
joined. On the contrary, the leading men of the intermediate Academy, Arcesilaos 
(315-241 BC) and Cameades (214-129), intended to revive the critical-scientific at¬ 
titude of Plato. In this way, there emerged an—although moderate—skepticism that 
believes that only probable insight is possible. The younger Academy considers the 
power of reason again as more positive and combines thoughts of various systems in 
an eclectic manner, in particular Platonic and Stoic thoughts. To the younger Acad¬ 
emy belong Philo of Larissa (160-79) and Antiochos of Ascalon (f 68 BC), heard 
by Cicero in Athens. The Platonism of the three academies is called older Platonism. 
The transition from this one to the new Platonism is mediated by the “intermediate” 
Platonism, with the main representative Plutarch (AD 50-125), who taught a religious 
Platonism with a strong emphasis on the absolute transcendence of God and assumed 
a series of steps of intermediate beings between God and the world. 

In the Middle Ages until the twelfth century, only Timaios was known and had 
a strong impact. In the twelfth century, Henricus Aristippus translated Menon and 
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Phaidon , and in the thirteenth century, W. von Moerbeke translated Parmenides. The 
new Platonism had more influence than Plato’s original ideas. The historical evolution 
of the philosophy of the Middle Ages was largely determined by the confrontation 
between Platonism and Aristotelian philosophy. In the early scholastic, Platonism was 
dominated mainly by Augustinus; in particular, the school of Chartres had a Platonic 
orientation. In the high scholastic, Platonism formed a strong undercurrent in the doc¬ 
trines of the Aristotelians (Albert, Thomas). It emerged as an independent movement 
among the mathematical-scientific thinkers (Robert Grosseteste, Roger Bacon, Witelo, 
Dietrich of Freiberg) and among the German mystics. The latter established the link 
to the Platonism of the early Renaissance (Nicholas of Cues). 

Modern Platonism began during the Italian Renaissance. In 1428, Aurispa brought 
the complete Greek text of Plato’s works from Constantinople to Venice. Latin trans¬ 
lations soon emerged, the most important one by Marsiglio Ficino, who completed it 
1453 and 1483. Followers of Platonism included Lionardo Bruni and the older Pico 
Della Mirandola, as well as Byzantines who had fled to Italy, among them the two 
Chrysoloras, Gemistos Plethon and Bessarion. Central to this movement was the Pla¬ 
tonic academy in Florence, founded in 1459 by Cosimo de Medici and guided by 
Ficino. From there, Platonism spread all over Europe. However, only in England did 
a truly Platonic school emerge (Cambridge). But Plato’s thoughts had lasting effects 
in the rationalistic systems of Cartesius, Spinoza, and Leibniz. Malbranche was even 
called the “Christian Plato.” In the nineteenth century, German idealism brought a 
revival of Plato’s system of thought. Hegel resorted not only to Platonism but even 
more to Plotinus and the New Platonism. Plato’s influence is seen in the recent past 
in the phenomenology of Husserl and in world philosophy. A.N. Whitehead explic¬ 
itly confessed to Platonism. Although his statement that all of European philosophy is 
only a footnote to Plato is exaggerated, it nevertheless rightly points out the immense 
influence of Plato’s philosophy. Platonism is even more dominant in the philosophy 
and theology of the Christian East, where the Platonic tradition of Origenes and of 
the Greek church fathers survives; for example, W. Solowjew and N. Berdjajew are 
Christian Platonics. [BR]. 

2 Aristotle, Greek philosopher, b. 384 BC, Stagira in Macedonia-d. 322, Chalcis on 
Euboea. He came at the age of 18 to Athens and became a student of Plato, where 
he remained at the Academy for almost two decades, first as a scholar and then as 
a teacher, finally opposing Plato with his own philosophy. After Plato’s death (347), 
he lived for three years in Asia Minor with Hermias, with the ruler of Atarneus. In 
343, he was called to the court of Phillip of Macedonia to be the tutor of Philip’s son 
Alexander. When Alexander ascended the throne, Aristotle returned to his hometown; 
however, in 334 he returned to Athens. In Athens, he founded the Peripataetic school, 
so called because of the covered walks (peripatoi ) surrounding the lyceum. He taught 
there for twelve years among an ever-increasing circle of scholars, until the revolt of 
Athens after Alexander’s death became dangerous to him, a friend of the royal dynasty. 
He went to his estate at Chalcis on Euboea, where he soon died. [BR]. 

3 Eudoxus of Cnidus (400-347 BC), Greek scientist. He was equally active as math¬ 
ematician, astronomer, and geographer. His biography is not recorded in detail, but 
it is considered certain that he was a member of the Platonic school. Later, he con¬ 
ducted his own school in Cyzicus. Eudoxus gave a new definition for proportion. He 
developed the exhaustive method and applied it to many geometric and stereometric 
problems and theorems, which he could prove exactly for the first time. Possibly, the 
major part of Euclid’s twelfth book is the work of Eudoxus. Eudoxus made a map of 
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the stars that remained top-ranking for centuries. He subdivided the sky into degrees 
of longitude and latitude, gave an improved value for the solar year, and improved the 
calendar. He estimated the earth’s circumference to be 400,000 stadia, edited a new 
map of the known continents, and wrote a geography of seven volumes. 

4 Archimedes, outstanding mathematician and mechanic of the Alexandrian era, 
b. about 285, Syracuse, killed by a Roman soldier during the capture of Syracuse. 
Archimedes was close to the Syracusean dynasty. He wrote important papers on math¬ 
ematics and mathematical physics, fourteen of which are preserved. He calculated the 
area and circumference of a circle, the area and volume of segments of the parabola, 
the ellipse, spiral, the rotation paraboloid, the one-shell hyperboloid, and others, and 
determined the center of gravity of these figures. For 7T, he gave a value between 
3 1/7 and 3 10/71; he developed in his “sand calculation” a method of exponential 
notation of arbitrarily large numbers, and in the Ephodos, a kind of integration cal¬ 
culus. Even more important than his treatment of the equilibrium conditions of the 
lever is the treatise on swimming bodies, where the principle of Archimedes is given. 
Archimedes determined the ratio of the volumes of the straight circular cone, the half¬ 
sphere, and the straight circular cylinder as 1: 2 : 3. Uncertain are the inventions of the 
water screw named after him and of the composed tackle; legendary is the burning of 
the Roman fleet by concave mirrors. [BR] 

5 Leonardo da Vinci, Italian painter, sculptor, architect, scientist, technician, b. April 
15, 1452, Vinci near Empoli-d. May 2, 1519, in the castle Cloux near Amboise; ille¬ 
gitimate son of Ser Piero, notary in Florence, and of a peasant girl. He was educated 
in his father’s house, and at the age of 15, he went to Florence as an apprentice of 
A. Verrocchio, who taught him not only painting and sculpting but also gave him an 
extensive education in the technical arts. In 1472, he was admitted to the Florentine 
guild of painters, but remained in Verrocchio’s studio. In this time, of common work 
the earliest of his preserved works emerged: an angel and the landscape in Verroc¬ 
chio’s painting of the baptism of Christ (Florence, Uffizi), two preachings (Uffizi and 
Louvre), and the Madonna with the vase (Munich, Pinakothek). About 1478, he be¬ 
came a freelancer and worked at Florence for about 5 more years. From this era stems 
the portrait of Ginevra Benci (Vaduz, gallery Liechtenstein), the unfinished painting 
of St. Jerome (Vatican), and the also unfinished great panel painting of the worship of 
the kings (Uffizi), which he got as an order for the high altar of a monastic church, but 
which he gave up half-finished when he left Florence at the end of 1481, to start work 
with Duke Lodovico of Milan. 

The end of the Sforza dynasty forced Leonardo to leave Milan (1499). Through 
Mantua, where he drew a portrait of margravine Isabella d’Este (Louvre), and Venice, 
where he drew up a defense plan against the threatening invasion of the Turks, he 
returned in April 1500 to Florence, where he began the painting of St. Anna Selb- 
dritt (Louvre). In May 1502, Leonardo started work as the first inspector of fortress 
buildings with Cesare Borgia, the papal military leader, throughout whose territory 
Leonardo traveled for about 10 months: the Romagna, Umbria, and parts of the 
Toscana. From this activity emerged a large fraction of his maps and city maps that— 
masterpieces of surveying and representation—belong to the earliest records of mod¬ 
ern cartography. Florence also asked for his advice as a war engineer; he worked on 
a plan to divert the Arno river, in order to cut off the main access road to Pisa with 
whom Florence was at war, and he designed the project for a channel to make the 
Amo navigable from the sea to Florence. Both of these plans were not realized, just 
like the draft proposed at the same time by sultan Bajasid II to built a bridge 300-m 
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long across the Bosporus. In 1503, Leonardo got the order to paint a monumental wall 
painting for the large senate hall of the Palazzo della Signoria in Florence; there, the 
drafts for the battle of Anghiari were born, which became the classical model in many 
copies, e.g., by Rubens for the cavalry-fight painting of Renaissance and Baroque, 
even to Delacroix. At the same time, Leonardo painted the Mona Lisa , presumably 
the world’s most famous painting, and the standing Leda (preserved only as copies). 
At this time, Leonardo reached the peak of his artistic fame. The arising geniuses of 
the young generation either admired him without envy (Raphael) or accepted him re¬ 
luctantly with jealousy (Michelangelo). Later, he painted only hesitantly, and more and 
more turned to scientific problems. Besides mathematical studies, he studied anatomy 
comprehensively. He dissected corpses and began an extensive treatise on the struc¬ 
ture of the human body, where he promoted the anatomical drawing accompanying 
the text as a tool for teaching. He also extended his biological and physical studies; 
the experiments on the flight of man—already begun in Milan—led him to investigate 
the flight of birds, which he also summarized like a treatise. Besides the laws of air 
flow, he also tried to investigate those of water flow. These studies contain approaches 
for theoretical and practical hydrology; he recorded them as materials for a treatise on 
water. In these years, he tried to arrange his notes by the main topics of his planned 
“books,” which as a whole comprise a theory of the mechanical primordial forces of 
nature, i.e., an entire cosmology. 

In 1506, Leonardo, at the request of the king of France released by the Florentine 
Signoria under the pressure of the political situation, stopped the work on the Anghiari 
battle and returned to Milan. There he served until 1513 mainly as an adviser to the 
French governor Charles d’Amboise, for whom he designed a large domicile and the 
plans for a chapel (S. Maria alia Fontana). From this era also date the drawings for 
the tomb of General Giangiacomo Trivulzio that—like the Sforza monument—was 
planned as an equestrian statue but was not realized. There is no clue to two almost 
finished Madonna paintings for His Most Christian Majesty. Also, in Milan Leonardo 
mainly dealt with scientific studies. He continued his great “anatomy,” in connection 
with the anatomist Marc Antonio Della Torre of Pavia, and he extended his hydrolog¬ 
ical and geophysical investigations both theoretically and practically, as is testified by 
his project for an Adda channel between Milan and the lake of Como, and by his amaz¬ 
ing geological observations on the origin of fossils. He also returned to his botanical 
studies; also in this field, he created exact demonstration drawings according to ex¬ 
actly defined principles of graphical representation, as in all of his research activities. 
He thereby founded the scientific illustration. 

When at the end of 1513 Leo X had risen to the papal throne, the now sixty-year- 
old Leonardo went to Rome, presumably with the hope of acquiring orders through 
his patron, Cardinal Giuliano de Medici. But he did not get big orders such as Raffael, 
Bramante, and Michelangelo got. His years in Rome were occupied by research, in 
particular on mechanics and anatomy. Only one painting, his last one, the mysterious 
John the Baptist (Paris, Louvre), may have originated in this period. 

In January 1517, Leonardo left Rome, following the invitation of Franz I. The coun¬ 
try castle Cloux near Amboise was allocated to him as a residence, and he got the title 
Premier peintre, architecte et mechanicien du Roi. He did not paint anymore, however, 
because of paralysis of his hand, but mainly arranged his scientific materials; in partic¬ 
ular, he worked on completing his “anatomy.” Among the few artistic creations of this 
last era, the project of a large castle and park for the residence of the Queen Mother 
in Romorantin is known. The building could not be built. His ideas nevertheless had 
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a lasting effect on the tremendous castle that was begun by Franz I when Leonardo 
was still alive, the building of Chambord. The most stirring documents of his late 
work are the drawings of the end of the world (Windsor), where Leonardo exhibited 
his experiences of life devoted to studying nature in a unique synthesis of scientific 
and artistic imagination; they are the symbol of the primordial forces penetrating the 
world, which once had created and finally shall destroy it, but even in self-destruction 
shall still obey the laws of harmony. 

Leonardo was buried in Amboise in the church of St. Florentin, which was de¬ 
stroyed during the French Revolution. His pupil and friend Francesco Melzi became 
heir to his enormous written work, which is almost completely done in mirror writing, 
familiar to him as a left-hander. 

The greatness of Leonardo and his significance in the history of occidental culture 
rests on the fact that he, like nobody else, understood art and science as a unity of hu¬ 
man will of perception and power of mental comprehension. As a painter, he was the 
first master of the classic style; his few artistic creations remained models of perfec¬ 
tion for all following eras and styles. As a researcher and philosopher, he is at the bor¬ 
derline between the Middle Ages and modem thinking. Altogether an empiricist, he 
tried to acquire an encyclopedic knowledge by means of experience and experiment. 
Guided by his imagination and less capable of abstract logical thinking, Leonardo 
must not—as was often tried—be considered as the founder of modern science as 
such. His achievements in the field of physics and pure mechanics are mediocre, often 
even questionable. But, since he performed his all-embracing observations on natural 
phenomena with ultimate objectivity and by virtue of his artistic talent was able to rep¬ 
resent them in drawings, he became the pioneer of a systematic descriptive approach 
in the natural sciences. Also, in the field of applied mechanics he can be considered 
as the founder of elementary engineering, for which he developed the graphical prin¬ 
ciples of demonstration. [BR] 

6 Isaac Newton , b. Jan. 4, 1643, Woolsthorpe (Lincolnshire)-d. March 31, 1727, 
London. Beginning in 1660, Newton studied at Trinity College in Cambridge, partic¬ 
ularly with the eminent mathematician and theologian I. Barrow. After getting various 
academic degrees and making a series of essential discoveries, in 1669 Newton be¬ 
came the successor of his teacher in Cambridge. From 1672, he was a member and, 
from 1703, president of the Royal Society. From 1688 to 1705, he was also a member 
of Parliament, from 1696, attendant, and from 1701, mint-master of the Royal mint. 
Newton’s life’s work, besides theological, alchemistic, and chronological-historical 
writings, mainly comprises works on optics and on pure and applied mathematics. 
In his investigations on optics, he described light as a flow of corpuscles and in this 
way interpreted the spectrum and the composition of light, as well as the Newton color 
rings, diffraction phenomena, and double-refraction. His main opus Philosophiae Nat- 
uralis Principia Mathematica (printed in 1687) is fundamental for the evolution of ex¬ 
act sciences. It includes the definition of the most important basic concepts of physics, 
the three axioms of mechanics of macroscopic bodies, e.g., the principle of actio et 
reactio , the gravitation law, the derivation of Kepler’s laws, and the first publication 
on fluxion calculus. Newton also dealt with potential theory and with the equilibrium 
figures of rotating liquids. The ideas for his big work mainly emerged in 1665 to 1666, 
when Newton left Cambridge because of the plague. 

In mathematics, Newton worked on the theory of series, e.g., in 1669, the bino¬ 
mial series, on interpolation theory, approximation methods, and the classification 
of cubic curves and conic sections. But Newton could not remove logical problems 
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even with his fluxion calculus that was represented in 1704 in detail. His influence on 
the further development of mathematical sciences can hardly be judged, since New¬ 
ton disliked publishing. For example, when Newton made his fluxion calculus public, 
his method was already obsolete compared with the calculus of Leibniz. The quarrel 
about whether he or Leibniz deserved priority for developing the infinitesimal calculus 
continued until the twentieth century. Detailed studies have shown that both of them 
obtained their results independently of each other. [BR] 

1 Joseph Louis Lagrange , mathematician, b. Jan. 25, 1736, Torino-d. April 10, 1812, 
Paris, at the age of 19 years, professor of mathematics in Torino. In 1766, he followed 
the call of Friedrich the Great to the Berlin academy of sciences. After Friedrich’s 
death, Lagrange moved to Paris as a professor at the Ecole Normale. He invented the 
principle named after him. Important for function theory is his Theorie des Fonctions 
Analytiques, Contenant les Principes du Calcul Differentiel (1789), and for algebra 
and number theory his Traite de la Resolution des Equations Numeriques des Tous 
Degres (1798). In the Mecanique analytique (1788), he generalized and condensed 
the principles of mechanics to the systems of equations named after him. [BR] 

8 Francis Bacon , English philosopher and politician, b. Jan. 22, 1561, London- 
d. April 9, 1626, London, son of Nicholas Bacon, nephew of Lord Burleigh; advocate 
and deputy. In the notorious trial of his patron Essex, Bacon convicted Essex of high 
treason. In 1607, he became Solicitor General; in 1613, Attorney General; in 1617, 
Keeper of the Great Seal; and, in 1618, Lord Chancellor. Ennobled as Baron Verulam 
and Viscount of St. Albans, in 1621 Bacon was thrown out by parliament because 
of passive corruption and was sentenced to high penalties and imprisonment, which, 
however, was remitted by the king’s influence. He was a curiously split character: 
outstandingly talented, vastly well read, vain, excessively ambitious, and of frighten¬ 
ing emotional frigidity. The reasons for his downfall were not only the proven and 
confessed failures, but equally the anger of the parliament about the egocentric and 
unauthorized policy of the king who utilized Bacon as a submissive tool. 

Bacon left a large number of philosophical, literary, and legal writings. His philo¬ 
sophical life’s work, the Instauratio Magna (i.e., great revival of philosophy), re¬ 
mained a fragment, an attempt (based on insufficient means) of a complete recon¬ 
struction of sciences on the basis of “unfalsified experience.” His main piece, Novum 
Organum (the title indicates the contraposition to Aristotle, whose logical writings 
traditionally were summarized under the title Organon ), is a method of scientific re¬ 
search, worked out down to the last detail, which shall serve to snatch the secrets of 
nature and to govern it (Bacon considered knowledge as the means for a purpose, 
“knowledge is power”). The starting point for any knowledge is experience. Expe¬ 
rience and mind should be tightly linked in a “legitimate marriage,” instead of the 
separation so far. Bacon constructed a complicated system of scientific induction, but 
he failed to appreciate the role of mathematics. His main piece is preceded by an in¬ 
ventory of all sciences {De diguitate et augmentis ), where—according to the three 
mental abilities memory, imaginative power, and mind—three main sciences are dis¬ 
tinguished: history, poetry, and philosophy. Bacon listed what had been achieved by 
each science and what still remained to be done. 

Among Bacon’s literary works, the essays suggested by Montaigne are timeless: 
10 in the first edition (1597), and 58 in the last edition (1625). In these “dispersed 
meditations,” Bacon presented practical life’s wisdom in the various fields, general 
guiding principles of the conduct of life, beyond good and evil, in an antithetical style 
of epigrammatic brevity, realistic and plain. “Nova Atlantis” is the perfect description 
of a philosophical ideal state. 
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Bacon’s legal writings testify to his absolute mastery of the subject. His plan of 
codifying the English law of his age was not completed. 

In the second half of the nineteenth century, Bacon was also considered to be the 
author of Shakespeare’s dramas (Bacon theory). [BR] 

9 Joachim Jungius , philosopher and scientist, b. Oct. 22, 1587, Liibeck-d. Sept. 17, 
1657, Hamburg, in 1609, professor in Giessen. In 1622, he founded in Rostock the first 
scientific society of Germany for the cultivation of mathematics and natural sciences. 
In 1624, he became a professor in Rostock; in 1625, in Helmstedt; and in 1628, head¬ 
master of the Johanneum and of the academic high school in Hamburg. He defended 
the principle “improvement of philosophy has to originate from physics (= natural 
sciences).” Jungius decisively contributed to the breakthrough of scientific chemistry 
and the renewal of atomism. He was also important as a botanist. [BR] 

10 Rene Descartes , b. March 31, 1596, La Haye-d. Feb. 11, 1650, Stockholm. 
Descartes was the son of a councilor of the parliament of Bretagne and was educated 
in a Jesuit college. He then began the study of law, and beginning in 1618, he partici¬ 
pated in various campaigns. Beginning in 1622, Descartes traveled in many countries 
of Europe, then settled down 1628 in the Netherlands, and lived from 1649 in Swe¬ 
den as a teacher of philosophy. The mathematical main achievement of Descartes is 
the foundation of analytical geometry in his Geometrie (1637), which also essentially 
influenced the further development of infinitesimal calculus. [BR] 

11 William Gilbert , English scientist and physician, b. May 24, 1544, Colchester- 
d. Nov. 30, 1603, London. Gilbert was from 1573 a practicing physician in London; 
from 1601, the private physician of Elizabeth I; and after her death, of King James 
I of England. In his fundamental work De magnete, magneticisque corporibus et de 
magnode magnets Tellure physiologia nova (London, 1600; facsimile edition, Berlin, 
1892; English translation and comment by S.R Thompson, in The collectors series in 
science , 1958), Gilbert summarized the knowledge of older authors to an impressing 
doctrine of magnetism and geomagnetism, and added a number of new observations 
and findings. The work, which in the second book also involves a special chapter 
on corpora electrica , on substances that—like amber (electrum)—after rubbing are 
capable of attracting light bodies, impressed several of his contemporaries, among 
others Kepler and Galileo. His treatise De monde nostro sublunari philosophia nova 
appeared posthumous (Amsterdam, 1561). [BR] 

12 Johannes Kepler , b. Dec. 27, 1571, Weil der Stadt-d. Nov. 15, 1630, Regensburg. 
Kepler was the son of a trader who also often served in the military. He first went 
to school in Leonberg, and later to the monastic school in Adelberg and Maulbronn. 
From 1589, Kepler studied in Tubingen to become a theologian, but in 1599, he took 
the position of professor of mathematics in Graz that was offered to him. In 1600, 
because of the Counter-Reformation Kepler had to leave Graz and went to Prague. 
After the death of Tycho Brahe (Oct. 24, 1601), as his successor Kepler became the 
imperial mathematician. After the death of his patron, Emperor Rudolf II, Kepler left 
Prague and in 1613 went to Linz as a land surveyor. From 1628, Kepler lived as an 
employee of the powerful Wallenstein, mostly in Sagan. Kepler died unexpectedly 
during a visit to the meeting of electors in Regensburg. 

Kepler’s main fields were astronomy and optics. After extraordinarily lengthy cal¬ 
culations, he found the fundamental laws of planetary motion: the first and second of 
Kepler’s laws were published in 1609 in Astronomia Nova , and the third one in 1619 
in Harmonices Mundi. In 1611, he invented the astronomical telescope. His Rudol- 
phian tables (1627) continued to be one of the most important tools of astronomy until 
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the modem age. In the field of mathematics, he developed the heuristic infinitesimal 
considerations. His best-known mathematical writing is the Stereometria Doliorum 
(1615), where, e.g., Kepler’s barrel rule is given. 

l3 Galileo Galilei , Italian mathematician, b. Feb. 15, 1564, Pisa-d. Jan. 8, 1642, 
Arcetri near Florence, studied in Pisa. At the Florentine Accademia del Dissegno, 
he got access to the writings of Archimedes. On the recommendation of his patron 
Guidobaldo del Monte, in 1589 he received a professorship for mathematics in Pisa. 
Whether or not he performed fall experiments at the leaning tower is not proven incon¬ 
testably; in any case, the experiments had to prove his false theory. In 1592, Galileo 
took a professorship of mathematics in Padua, not because of disagreements with col¬ 
leagues but for a better salary. He invented a proportional pair of compasses, furnished 
a precision mechanic workshop in his home, found the laws for the string pendulum, 
and derived the laws of falling bodies first in 1604 from false assumptions and then 
in 1609 from correct assumptions. Galileo copied the telescope invented one year ear¬ 
lier in the Netherlands, used it for astronomical observations, and published the first 
results in 1610 in his Nuncius Siderus , the “star message.” Galileo discovered the 
mountainous nature of the Moon, the abundance of stars of the Milky Way, the phases 
of Venus, the moons of Jupiter (Jan. 7, 1610), and in 1611 the sunspots, although for 
these Johannes Fabricius preceded him. 

Only beginning in 1610 did Galileo, who returned to Florence as Court’s mathe¬ 
matician and philosopher to the grand duke, publicly support the Copernican system. 
By his over-eagerness in the following years, he provoked in 1614 the ban of this doc¬ 
trine by the pope. He was urged not to advocate it further by speech or in writing. 
During a dispute on the nature of the comets of 1618, where Galileo was completely 
right, he wrote as one of his most profound treatises the Saggiatore (inspector with the 
gold balance, 1623), a paper dedicated to Pope Urban VIII. Since the former cardinal 
Maffeo Barberini had been well disposed toward him, Galileo hoped to win him as 
pope for accepting the Copernican doctrine. He wrote his Dialogo , the “Talk on the 
Two Main World Systems,” the Ptolemyan and the Copernican, gave the manuscript 
in Rome for examination, and published it 1632 in Florence. Since he obviously had 
not included the agreed-upon changes of the text thoroughly enough and had shown 
his sympathy with Copernicus too clearly, a trial set up against Galileo ended with his 
renunciation and condemnation on June 22,1633. Galileo was imprisoned in the build¬ 
ing of inquisition for a few days. The statement “It (the earth) still moves” (. Eppur si 
mouve) is legendary. Galileo was sentenced to unrestricted arrest, which he spent with 
short breaks in his country house at Arcetri near Florence. There, he also wrote a work 
important for the further development of physics: the Discorsi e Dimonstrazioni math- 
ematiche , the “conversations and proofs” on two new branches of science: mechanics 
(i.e., the strength of materials) and the science branches concerning local motions 
(falling and throwing) (Leiden 1638). 

In older representations of Galileo’s life, there are many exaggerations and mis¬ 
takes. Galileo is not the creator of the experimental method, which he utilizes no 
more than many other of his contemporaries, although sometimes more critically than 
the competent Athanasius Kircher. Galileo was not an astronomer in the true sense, 
but a good observer; and as an excellent speaker and writer, he won friends and pa¬ 
trons for a growing new science and its methods among the educated of his age, and 
he stimulated further research. Riccioli and Grimaldi in Bologna confirmed Galileo’s 
laws of free fall by experiment. His scholars Torricelli and Viviani developed one of 
Galileo’s experiments—for disproving the “horror vacui”—to the barometric experi- 
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ment. Christian Huygens developed his pendulum clock based on Galileo’s ideas, and 
he transformed Galileo’s kinematics to a real dynamics. 

Galileo was one of the first Italians who used their native language for presentation 
of scientific problems. He defended this point of view in his correspondence. His prose 
takes a special position within the Italian literature, since it is distinguished by its 
masterly clarity and simplicity from the prevailing bombast that Galileo had reproved 
in his literary-critical essays on Taso et al. In his works II Dialogo sopra i due massimi 
sistemi (Florence 1632) and I Dialoghi delle nouve scienze (Leiden 1638), he utilized 
the form of dialogue that came down from the Italian humanists, to be understood by 
a broad audience. [BR] 

14 Christian Huygens , Dutch physicist and mathematician, b. April 14, 1629, Den 
Haag-d. July 8, 1695, Den Haag. After initially studying law, he turned to mathemati¬ 
cal research and published among other things in 1657 a treatise on probability calcu¬ 
lus. At the same time, he invented the pendulum clock. In March 1655, he discovered 
the first moon of Saturn, and in 1656, the Orion nebula and the shape of Saturn’s ring. 
By then, he was already familiar with the laws of collision and of central motion, but 
published them-without proof—only in 1669. In 1663, Huygens was elected a mem¬ 
ber of the Royal Society. In 1665, he settled in Paris as a member of the newly founded 
French academy of sciences, from where he returned in 1681 to the Netherlands. After 
publishing in 1657 the small treatise Horologium and in 1659 his Systerna Saturnium, 
sive de causis mirandorum Saturni phaenomeno , in 1673 emerged his main work: 
Horologium oscillatorium (the pendulum clock), which besides the description of an 
improved watch construction contains a theory of the physical pendulum. Further one 
finds treatises on the cycloid as an isochrone, and important theorems on central mo¬ 
tion and centrifugal force. From 1675 dates Huygens’s invention of the spring watch 
with a balance spring, from 1690 the Tractatus de lumine (treatise on light), which 
contained a first version of the wave theory (collision theory) of light, and based on 
that, the theory of double refraction of Iceland spar is developed. The spherical prop¬ 
agation of action around the light source is explained there by means of Huygens’ 
principle. [BR] 

15 Copernicus, Coppernicus, German Koppernigk, Polish Kopernik, Nikolaus, as¬ 
tronomer, and founder of the heliocentric world system, b. Feb. 19, 1473, Thom-d. 
May 24, 1543, Frauenburg (East Prussia). Beginning in 1491, he engaged in human¬ 
istic, mathematical, and astronomical studies at the university in Cracow. From 1496 
to 1500, he studied civil and clerical law in Bologna. At the instigations of his un¬ 
cle, bishop Lukas Watzelrode, in 1497 he was admitted to the chapter of Ermland at 
Frauenburg, but he took only the lower holy orders. In Bologna, he continued his as¬ 
tronomical work together with the professor of astronomy Dominico Maria Novarra, 
made a short stay in Rome, and in 1501 temporarily returned to Ermland. Beginning 
in the autumn of 1501, he studied in Padua and Ferrara, graduating on May 31, 1503, 
as a doctor of canonical law, and then studied medicine. After returning home in 1506, 
he lived in Heilsberg as secretary to his uncle from 1506 until his death in 1512. He 
was involved in administrating the diocese of Ermland, and he accompanied his uncle 
to the Prussian state parliaments and the Polish imperial parliament. As chancellor of 
the chapter, Copernicus after 1512 lived mostly in Frauenburg, resided as governor of 
the chapter (1512-1521) in Mehlsack and Allenstein, and in 1523 was administrator 
of the diocese of Ermland. As a deputy, he represented the order chapter (1522-1529) 
at the Prussian state parliaments and there particularly supported monetary reform. 
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Contrary to Polish claims, Copernicus’s German origin is established (the paternal 
family stems from the diocesan country Neiss in Silesia). Like his elder brother An¬ 
dreas, he defended the concerns of Ermland against the Crown of Poland; both in writ¬ 
ing and in oral speech, he utilized only German and Latin. Besides his administrative 
work, he also practiced as a physician. Toward the end of his life, in 1537, Copernicus 
had differences with Johannes Dantiscus, the newly elected bishop of Ermland. 

As an astronomer, Copernicus completed what Regiomontan had imagined: a revi¬ 
sion of the doctrine of planetary motion, taking into account a series of critically evalu¬ 
ated observations. Only on such a basis could one then think of reforming the calendar. 
The urgency of that reform was generally recognized at the beginning of the sixteenth 
century. Copernicus presumably was influenced by these considerations. In the course 
of his work, he then decided to accept a heliocentric world system, inspired by ancient 
writings. A brief, preliminary report on this topic is the Commentariolus , presumably 
written before 1514. Here, already the decisive assumptions are expressed: The sun is 
in the center of the planetary orbit—still considered as circular—and the earth circu¬ 
lates about the sun; the earth daily rotates about its axis and in turn is orbited by the 
moon. The wider public got the information on the Copernican doctrine only by the 
Narratio prima of Georg Joachim Rheticus (first report on the six books of Copernicus 
on the circular motions of celestial paths, 1540, German, by K. Zeller, 1943). 

The main work of Copernicus, the “Six Books on the Orbits of Celestial Bodies” 
(Die revolutionibus orbium coelestium libri VI , 1543; German, 1879; new edition, 
1939), was published in the year of death of the author. It was dedicated to Pope 
Paul III, but instead of the original foreword of Copernicus, it was introduced with a 
foreword by the Protestant theologian Andreas Oslanden that inverted the meaning of 
the whole subject. The doctrines of Copernicus remained uncontested by the church 
until the edict of the index congregation of 1616. The imperfections of the Copernican 
theory of planets were removed by Johannes Kepler. [BR] 

16 Giovanni Alfonso Borelli , b. 1608, Naples-d. Dec. 31, 1679, Rome, physicist and 
physiologist. In 1649, Borelli became a professor of mathematics in Messina and in 
1656 moved to Pisa. In 1667, he returned to Messina. In 1674, he was forced to leave 
to Rome. There he lived until his death under the patronage of Christina, Queen of 
Sweden. His best-known work, De motu animalium, deals with the motions of the 
body of animals, which he traced back to mechanical principles. In a letter published 
in 1665 under the pseudonym Pier Maria Mutoli, Borelli first expresses the idea of 
a parabolic trajectory of comets. Among his numerous astronomical works, there is 
also Theoretica mediceorum planetarum ex causis physicis deducta (Florence, 1666), 
treating the influence of the attracting force of Jupiter’s moons on the orbital motion 
of Jupiter. 

11 Giovanni Battista Benedetti , b. Aug. 14, 1530-d. Jan. 20, 1590, Torino, first to 
recognize the buoyancy action of the surrounding medium in free fall. Taking up the 
ideas of Archimedes, he writes in his work De resolutione omnium Euclidis problema- 
tum (Venice 1553) that the fall velocity shah be determined by the difference of the 
specific weights of the falling body and the medium. 

18 Robert Hooke , English researcher, b. July 18, 1635, Freshwater (Isle of Wight)- 
d. March 3, 1703, London. Hooke was at first an assistant to R. Boyle; then from 1665 
a professor of geometry at Gresham College in London; and from 1677 to 1682, secre¬ 
tary of the Royal Society. Hooke improved already-known methods and devices, e.g., 
the pneumatic pump and the composite microscope (described in his Mikographia , 
1664). Hook was often involved in questions on priority, e.g., with Huygens, Hevelius, 
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and Newton. He proposed, among others, the melting point of ice as the zero point of 
the thermometric scale (1664), recognized the constancy of the melting and boiling 
point of substances (1668), and for the first time observed the black spots on soap 
bubbles. He gave a conceptually good definition of elasticity and in 1679 established 
Hooke’s law. [BR] 

19 Plutarch (Greek Plutarchos), Greek philosopher and historian, b. about AD 50, 
Charonea, from an old bourgeois family-d. about 125. He was educated in 66 in 
Athens by the academician Ammonios to be a follower of Plato’s philosophy. He 
visited, among other cities, Alexandria and Rome, where he had contact with promi¬ 
nent Romans, but lived permanently in his small hometown. There, he participated in 
communal politics; in Delphi he became a priest about 95. Plutarch was honored by 
the emperors Trajan and Hadrian. However, his life center remained in the vicinity of 
his homeland, where he closely associated with family and a circle of friends. This 
milieu feeds the ethos of education and the national pathos of his numerous writings. 
Despite the wealth of topics and the abundance of material, they nevertheless display 
an inner unity from the pleasant integrity of a personality formed by philosophy and 
religion. 

Plutarch’s works consist of two groups. The first group contains the biographies 
C Vitae parallelae ), 46 comparative life descriptions of famous Greeks and Romans 
(e.g., Pyrrhos-Marius, Agesilaos-Pompeius, Alexander-Cesar). Great Romans were 
presented to the Greeks, in parallel and in contrast to their own history. Thereby an 
important step toward inner balance of the double-culture of the Greek-Roman era of 
emperors was made. The literary appeal of the biographies rests on the vivid represen¬ 
tation and the description of characters, supported by memorable anecdotal features. 

The second group, Moralia , contains popular, ethical-educational writings, but also 
strictly philosophical, metaphysical, religious-philosophical investigations, learned 
antiquarian studies, political treatises, etc. This writing was borne by the tradition 
of the Platonic school, without schoolmasterly narrowness, with vivid religious em¬ 
phasis. 

While Plutarch was read in the Byzantine empire, he was unknown to the West in 
the Middle Ages. In the early fifteenth century (Guarino and his scholars, then Pier 
Decembrio, L. Bruni), Plutarch’s works were translated to Latin; from 1559 to 1572, 
by J. Amyot in classical form into French (fertilizing influence on the dramatic art in 
France in the seventeenth century); and in 1579, by North into English (influence on 
Shakespeare). In Germany, for a long time Plutarch was appreciated only by learned 
circles. Only toward the end of the eighteenth century was the interest turned again to 
him (Schiller, Goethe, Jean Paul, Beethoven, and Nietzsche). [BR] 


Recommendations for Further Reading on Theoretical Mechanics 

The textbooks on theoretical mechanics listed below represent only part of the wealth 
of excellent literature on this topic. 

Classical textbooks on mechanics: 

H. Goldstein: Classical Mechanics , 3rd edition (2001), Addison-Wesley Pub. Co. 
A. Sommerfeld: Mechanics (Lectures on Theoretical Physics , Vol. 1), 4th edition 
(1964), Academic Press. 
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L. D. Landau and E.M. Lifschitz: Mechanics , 3rd edition (1982), Butterworth- 
Heinemann. 

Problems and exercises for classical mechanics: 

M. R. Spiegel: Theory and Problems of Theoretical Mechanics (Schaum’s Outline 
Series), Si-edition (1980), McGraw-Hill. 

More mathematical presentations of mechanics: 

F. Scheck: Mechanics: From Newton's Laws to Deterministic Chaos , 3rd edition 
(1999), Springer. 

J.B. Marion and S.T. Thornton: Classical Dynamics of Particles and Systems , 4th 
edition (1995), Saunders College Publishing. 

We consider the work of H. Goldstein to be particularly suited as an addendum. 
Starting from the elementary principles, he outlines the formal Hamilton-Jacobi the¬ 
ory in a didactically brilliant manner. All typical applications (central force problem, 
rigid body, vibrations etc.) are discussed and expanded on in exercises and by special 
recommendations for further reading. The lectures by A. Sommerfeld, planned in a 
similar way, represent a gold mine because of the treatment of many special problems 
and the imaginative power demonstrated in the mathematical solution techniques. 

Readers may gain an appreciation of the formal aesthetics of the volume Mechanics 
from the textbook by Landau and Lifschitz. 
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air resistance, 317 
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approximation, successive, 12 
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baker transformation, 509, 515 
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- time-dependent, 499 
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canonical transformation, 365 
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Cardano formula, 113 
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center of gravity, 43 

- circular cone, 47 
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center-of-mass coordinates, 70 
central field, scattering in, 50 
central forces, 66 

central motion, 331 
centrifugal force governor, 322 
centripetal acceleration, 7, 9 
chain, vibrating, 88 
Chandler period, 218 
chaos, 461 

chaotic attractor, 468, 507 
characteristic equation, 189 
charged particle, 314 
Chasles’ theorem, 41 
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circular mapping 

- dissipative, 532 
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cluster property, 44 
collision parameter, 52 
conditionally periodic, 154 
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- generalized, 303 
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- holonomic, 259 

- nonholonomic, 259, 301 
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continuum, 39 
contracting flow, 466 
control parameter, 463 
cooling 

- of particle beam, 359 

- stochastic, 355 
coordinates 

- generalized, 259, 262, 463 

- ignorable (cyclic), 277 
coordinate system 
-body-fixed, 186 

- center-of-mass, 53, 70 

- for rotating earth, 9 

- rotating, 3, 7 

Coriolis acceleration, 7, 36 

Coulomb scattering, 56 

coupled mass points, vibrations, 81 

coupled pendulums, 85 

couple of forces, 161 

critical point, 469 

cross section 

- differential, 51 

- Rutherford, 55 
cycloid, 131, 292 
cyclone, 34 

D’Alembert principle, 267 
deformable medium, 39 
degeneracy, 137 
degrees of freedom, 41 
determinant of coefficients, 90 
deterministic, 463, 504 
deviation moments, 163, 186 
devil’s staircase, 535 
differential cross section, 51 
differential principles, 337 
dimension 

- broken, 508, 509 

- capacity, 511 

- fractal, 511 

- Hausdorff, 514 

- information-, 514, 543 

- Lyapunov, 543 
Dirichlet conditions, 121 
discrete systems, 517 
discretization, 487, 531 
dispersion law, 106 
displacement, virtual, 267 
dissipation, 464-466 
dissipation function, 315 
dissipative circular mapping, 532 
double pendulum, 289 
dynamical system, 463 

earth, nutation of, 217 
eastward deflection, 12, 16 
eclipse, 230 
ecliptic, 226 

eigenfrequency, 81, 82, 137 
eigenmodes, orthonormal, 152 
eigenvalue, 189 


eigenvector, 189 
eigenvibrations, 82 
electromagnetic field, 314 
ellipsoid, rotating, 218 
ellipsoid of inertia, 196 

- quadratic disk, 202 

- regular polyhedron, 218 
energy law, 68 

equation, characteristic, 189 
equilibrium solution, 469 
ergodic system, 505, 530 
Euler angles, 238 
Euler equations, 214 
Euler-Lagrange equation, 341 
extended canonical equations, 419 
extended canonical transformations, 416,428 
extended Euler-Lagrange equation, 415 
extended Hamilton-Jacobi equation, 455 
extended point transformations, 433 
extended Poisson brackets, 434 
externally excited system, 486 

fall machine, Atwoods, 75 
Feigenbaum constant, 522 
figure axis, 209 
fixed point, 469, 507, 517 

- unstable, 470 

Floquet multiplier, 490, 493 
Floquet’s theory of stability, 489 
flow of a vector field, 464 
force 

- generalized, 265 

- Lorentz, 311 

- nonconservative, 315 
Foucault’s pendulum, 23 
Fourier coefficients, 122 
Fourier series, 121 
fractal, 508 

fractal dimension, 511 
free fall, 9 

frequency spectrum, (an)harmonic, 137 

friction force, 206 

friction parameter, 538, 541, 544 

friction tensor, 316 

function 

- even (odd), 123 

- homogeneous, 330 
fundamental harmonic, 137 

galaxy, flattening of, 67 
generalized Noether theorem, 447 
generating function, 366 
gravitational acceleration, 9 
Green’s function 

- advanced, 3 

- free, 5 

gyrocompass, 229 
gyroscope, 228 

Hamilton action function, 383 
Hamilton equations, 329, 349 
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Hamilton principle, 337 

Hamiltonian, 328 

Hamiltonian mechanics, 463 

Hamilton-Jacobi differential equation, 383 

Hamilton-Jacobi theory, 383 

harmonic oscillator 

- canonical formalism, 370 

Hausdorff dimension, 514 

herpolhodie, 212 

hockey puck, 176 

holonomic, 259, 279, 317 

Hopf branching, 498 

hydrogen atom, 408 

Hyperion, 544 

inertial system, 3 
information dimension, 514, 543 
integrability condition, 260 
integral principles, 337 
iteration method, 13 
iterative mapping, 517 

Jacobi matrix, 469, 492 

Kaplan-Yorke relation, 543 


mass, reduced, 72 
mass density, 45 
membrane 

- circular, 141 
-rectangular, 135 
-vibrating, 133 
Mercury, 549 

mode locking, 533, 536 
molecule 

- asymmetric linear, 289 

- three-atom, 284 

- triangular, 286 
momentary center, 48 
moment of inertia, 162, 545 

- circular cylinder, 163 

- cube, 167 

- rectangular disk, 165 

- rigid bodies, 174 

- sphere, 167 
momentum 

- generalized, 276 

- linear, 65 

momentum space, 350 
monodromy matrix, 490 
motion, central, 331 


Kepler problem, 389 
Kepler system 

- canonical formalism, 380 
Kepler’s law, 546 
kinetic energy, 72 

- of a rotating rigid body, 187 
Koch curve, 510, 512 


neutron star, vibrating, 220 
Newton’s equations 

- arbitrary relative motion, 7 

- rotating coordinate system, 3, 7 
nodal line, 138 

node, 97, 471, 496 
nonconservative forces, 315 


laboratory system, 3 
Lagrange equations, 276 
- nonholonomic constraints, 301 
Lagrange multipliers, 301 
Lagrangian, 275 

Legendre transformation, 327, 369 
libration, 539 

limit cycle, 475, 477, 483, 492, 500, 507 

linearization, 469 

Liouville theorem, 352, 376, 465 


nonholonomic, 301, 317 
nonlinear dynamics, 461 
normal frequencies, 84, 289 
normal vibrations, 82, 105 
North Pole 

- geometrical, 218 

- kinematic al, 218 
nutation, 212 
nutation cone, 212 
nutation of earth, 217 


Lippmann-Schwinger equation, 5 
Lissajous figure, 155 
logistic mapping, 519, 523, 527 
Lorentz force, 311 
Lorenz, 505 

Lyapunov dimension, 543 
Lyapunov exponent, 504, 508, 551 

- logistic mapping, 525, 530 

- maximum, 506, 541 


one-dimensional mapping, 503 
orbit, 464 

orthogonality relation 

- for trigonometric functions, 122 
oscillator 

- nonlinear, 473 

- Rayleigh, 476 

- van der Pol, 476, 480 
overtone, 107, 137, 149 


Mandelbrot, 508 
many-body system, 65 
mapping 

- iterative, 517 
-logistic, 519, 523,527 

- one-dimensional, 503, 518 

- Poincare, 488, 503, 517, 537 

- stroboscopic, 487 


parabola, Neil, 400 
paraboloid, 305 
path stability, 485 
Peano curve, 513 
pendulum 

- cone, 30 

- coupled, 85 

- double, 289 
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- Foucault, 23 

- periodically driven, 537 

- physical, 166 

- plane, 351 

- rolling, 169 

- string, 294 

- theories, 332 

- upright, 281, 282 
pendulum length, reduced, 169 
pendulum watch, 534 
period, Chandler, 218 
period-doubling, 501, 520, 539 
periodic attractor, 533 
periodic solution, 486 
perturbation calculation, 11 
phase diagram, 351 

phase flow, 464, 467 
phase integral, 388 
phase space, 350 
phase-space density, 354 
phase trajectory, 350 
phase velocity, 106 
physical pendulum, 166 
pirouette, 67 

pitchfork branching, 497, 520 

Pivot forces, 222 

Planck’s constant, 408 

plane, invariable, 210 

Poincare, 461, 504 

Poincare-Bendixson theorem, 480 

Poincare cut, 487, 542, 549 

Poincare mapping, 488, 503, 517, 537 

Poincare recurrence time, 150 

Poinsot ellipsoid, 210 

point attractor, 518 

point transformation, 370 

Poisson bracket, 378, 410 

pole cone, 212 

pole curve, 48 

pole path, 48 

pole trajectory, 212 

polhodie, 212 

potential 

- generalized, 312 

- scalar, 314 

- vector, 314 

- velocity-dependent, 311 
precession, 225 

- stationary, 246 
precession velocity, 247 
principal axes of inertia, 188 
principal axis, 203 
Principle of causality, 5 
projectile, 51, 317 

quantum hypothesis, 407 
quantum number, 105 
quasiperiodic, 533, 535, 549 

Rayleigh oscillator, 476 
recurrence time, Poincare, 150 


reduced mass, 72 
relaxation vibration, 479, 482 
resonances, 548, 549 
rheonomic, 259, 279 
river, superelevation of bank, 18 
rolling pendulum, 169 
rosette path, 28 

rotating coordinate system, 3, 7 
rotating ellipsoid, 218 
rotating tube, 35 
rotation, 41 

rotation about a fixed axis, 161 
rotation about a point, 185 
rotational velocity, 7 
rotation energy, 188 
rotation matrix, 194, 240 
rotator, periodically kicked, 530 
rotor, 471 

Runge-Lenz vector, 382 
Rutherford scattering, 55 

saddle-node branching, 496 
saddle point, 471, 496 
Saros cycle, 230 
Saturn, 544 

scattering, in a central field, 50 
scattering cross section 

- Rutherford, 55 

- square well potential, 59 
scattering experiment, 51 
scattering of two atoms, 63 
scleronomic, 259 

sea level, 18 

secular equation, 470 

self-exciting vibration, 479 

self-similarity, 510, 525, 527, 535 

Sierpinski gasket, 511 

similarity transformation, 195 

slant throw, 395 

sleeping top, 247 

solar system, 76 

sound velocity, 106 

spin, 243 

spiral, 471, 473 

stability, 469 

- asymptotic, 470, 485 

- orbital, 485 

- time-dependent paths, 485, 488 
stability of paths, 488 
staggering motion, 544, 551 
standard mapping, 532 
Steiner’s theorem, 164 
stochastic cooling, 355 
straight line, invariable, 210 
strange attractor, 468, 509, 542 
stretching and folding, 508, 528 
string, vibrating, 101, 108 
string pendulum, 294 

string tension, 24, 101, 261, 268 
stroboscopic mapping, 487 
subcritical branching, 498 
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subharmonic bifurcation, 501 
subharmonic cascade, 540 
supercritical branching, 498 
superposition principle, 105 
superstability, 524, 526, 535 
surface density, 46 
symmetry axis, 203 
symmetry-breaking bifurcation, 541 
synchronization, 533 
system of mass points, 65 
system of principal axes, 188 

target, 51 

tautochrone problem, 129 

tensor, 194 

tensor of inertia, 186 

- square, 191 

- three mass points, 204 
theorem of Chasles, 41 
theorem of Liouville, 350 
theorem of Steiner, 164 
theory of chaos, 505 

theory of stability, Floquet, 489 
theory of top 

- analytical, 213 

- geometrical, 210 
tidal forces, 230 
tidal friction, 547, 549 
top 

- asymmetric, 255, 297 

- free, 209 

- heavy, 224, 249 

- oblate, 209 

- prolate, 209 

- rolling circular, 199 

- sleeping, 247 

- spherical, 191, 209 

- symmetric, 191 
top moment, 226 
torque, 161 

- elliptic disk, 223 

- of rotating plate, 219 
torus, 507 


total time derivative, 412 
trace cone, 212 
trace trajectory, 212 
trajectory, 464, 538 
transcritical branching, 498 
transformation 

- canonical, 365 

- of kinetic energy, 72 

- to center-of-mass coordinates, 70 
transients, 467, 525 
translation, 41 

transverse cut, 487 
tube, rotating, 279 
turbulence, 461 

van der Pol oscillator, 476, 480 
variational problem, 338, 340 
vector field, 463 
vector product, 4 
velocity 

- angular, 6 

- generalized, 264 

- true, 7 

- virtual, 7 

velocity-dependent potential, 311 
velocity field, 463, 466 
vibrating chain, 88 
vibrating membrane, 133 
vibrating string, 101, 108 
vibration antinode, 97 
vibration, self-exciting, 479 
vibrations of coupled mass points, 81 
virtual displacement, 267, 556 
virtual forces, 7 

virtual work, 267, 269, 319, 556 
volume density, 43 
Voyager, 544, 551 

wave equation, 102 

wavelength, 105 

wave number, 105 

winding number, 533 

work, virtual, 267, 269, 319, 556 



